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HEPAHnS C VIRUS THCV^ POLYPEPTIDES 



10 Technical Meld 

Hie invention relates to materials and methodologies for managing 
the spread of hepatitis C virus (HCV) infection. More qjecifically, it relates to 
polypeptides useful as immunological reagents in the detection, prevention and 
treatment of HCV infections. 




HCV was first identified and characterized as a cause of non-A, 
non-B hepatitis (NANBH) by Houghton et al. This led to the disclosure of a 
number of general and specific polypeptides useful as immunological reagents. 

20 See. e.g., Houghton et al., EPO Pub. No. 318,216; Houghton et al., EPO Pub. 
No. 388,232; Choo et al., Science (1989) 244:359-362; Kuo et al.. Science (1989) 
244:362-364; Houghton et al., Hepatolopy (1991) 14:381-388. These 
publications provide the art with an extensive background on HCV generally, as 
weU as the manufacture and uses of HCV polypeptide immunological reagents. 

25 For brevity, therefore, the disclosure of these publications in particular are 
incorporated herein by reference. 

Others have readily applied and extended the work of Houghton et 
al. See, e.g.. Highfield et al., UK Pat. App. 2,239,245 (The WeUcome 
Foundation Ltd.); Wang, EPO Pub. No. 442,394 (United Biomedical Inc.); Leung 

30 et al., EPO Pub. No. 445,423 (Abbott Laboratories); Habits et aL, EPO Pub. No. 
451,891 (Akzo N.V.); Reyes et al., PCT Pub. No. WO 91/15516 (Genelabs Inc.); 
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Maid et aL, EPO Pub. No. 468,657 (Tonen Corp.); and Kamada et al., EPO Pub. 
Na. 469,348 (Shianogi Seiyaku K.K.). 

Sensitive, specific methods for screening and identifying carriers of 
HCV and HCV-contaminated blood or blood products are an important advance in 
5 medicine. Post-transfusion hepatitis (PIH) occurs in approximately 10% of 
transfused patients, and HCV has accounted for up to 90% of these cases. The 
major problem in this disease is the frequent progression to chronic liver damage 
(25-55 %). Patient care as well as the prevention of transmission of HCV by 
blood and blood products or by close personal contact require reliable diagnostic 

10 and prognostic tools, such as HCV polypeptides, to detect antibodies related to 
HCV, Such polypeptides are also usefiil as vaccines and immunotherapeutic 
therapeutic agents for the prevention and/or treatment of the disease. 

Since HCV is a relatively new agent, a contiouing need exists to 
define additional immunological reagents that will allow further study of the 

15 clinical course of disease and the epidemiology of HCV in the population. 

nisnlnsiire nf the Invention 

The invention pertains to the characterization of new HCV epitopes. 
The diaracterization of these qntopes permits the manufacture of polypeptide 

20 products which reacted immunologically with antibodies to HCV and/or generate 
anti-HCV antibody production in vivo. These polypeptide products are useful as 
standards or reagents in diagnostic tests and/or as components of vacciaes. Anti- 
bodies, including for example botii polyclonal and monoclonal, directed against 
HCV epitopes contained within these polypeptide sequences are also useful 

25 reagents, for example, in diagnostic tests, as therapeutic agents, for screening of 
antiviral agents, and for the isolation of isolation/purification of HCV polypeptides 
or particles. 

In its broadest sense, the present invention is directed to 
polypeptides containing the newly characterized HCV epitopes disclosed herein, 
30 methods of manufacturing such polypeptides (e.g., recombinant and synthetic 
methods), methods of using such polypeptides (e.g., diagnostic, vaccme, and 
therapeutic), and articles of manufacture, compositions or formulations adapted to 
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such uses (e.g., polypeptides ixxed to an immunoassay or other support, oral or 
injectable pharmaceutical compositions). Similarly, antibodies (polyclonal, 
monoclonal, or equivalents such as binding fragmrats, single-chain antigen-binding 
proteins, etc.) to the HCV epitopes disclosed herein are also included within the 
scope of the present invention, as well as methods of making such antibodies, 
methods of using such antibodies (e.g., diagnostic, vaccine, and therapeutic), and 
articles of manu&ctare, compositions or formulations adapted to such uses (e.g., 
antibodies fixed to an immunoassay or other sty>port, oral or injectable 
pharmaceutical compositions). 

Other aspects of the invention pertain to kits for analyzing samples 
for the presence of an HCV antigen comprising the above antibodies in a suitable 
container. Still other aspects of the invraition pertain to kits for analyzing samples 
for the presence of an antibodies directed against an HCV antigen comprising a 
polypeptide as described above in a suitable container. 

Still other aspects of the invention are: a method for producing a 
polypeptide containing the newly disclosed HCV q)itopes comprising incubating 
host cells transformed with an expression vector containing a sequence encoding a 
polypeptide containing the HCV qpitope under conditions which allow ejcpression 
of said polypeptide; and a polypeptide containing such an HCV epitope produced 
by this method. 

Immunoassays are also included in the invention. These inchide an 
immunoassay for detecting an HCV antigen comprising incubatmg a sample 
suspected of containing an HCV antigen with an antibody as described above 
under conditions wbidi allow the formation of an antigen-antibody complex; and 
detecting an antigen-antibody complex containing the antibody. An immunoassay 
for detecting anti-HCV antibodies comprising incubating a sample suspected of 
containing anti-HCV antibodies with a polypqitide as described above, under 
conditions which allow the formation of an antibody-antigen complex; and 
detecting tiie antibody-antigen complex containing the polypqptide. 

Also included in the invention are vaccines for treataient of HCV 
infection comprising an immunogenic peptide containing an HCV epitope 
described herem. 
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4 

Yet another aspect of the invention is a method for produdiig 
antibodies to HCV comprising admimstering to an individual an isolated 
immunogenic polypeptide containing an HCV epitope described bessan in an 
amount sufficient to produce an immune response. 
5 The above aspects of the present invention are accompli^ed by the 

discovery of HCV epitopes of the formula 

wherein aa denotes an amino acid; 
X and y are integers such tiiat y-x ^ 6; 
10 aa^-aay indicates a portion of the amino acid sequence of Figure 1; 

and 

X is selected fixjm the group consisting of 23-34, 36, 66-79, 81-94, 
96-98, 101-103, 186-189, 191, 206, 223, 232, 256, 286, 297-299, 321, 347, 357, 
413, 414, 432, 465-471, 480-484, 501, 502, 521, 540-549, 579, 594-599, 601- 

15 613, 641, 662-665, 685, 705, 706, 729, 782-789, 801, 851-855, 893, 916, 928, 
946, 952-954, 1026, 1072, 1109, 1112-1117, 1218, 1240, 1280-1285, 1322, 1338, 
1371, 1384, 1410, 1411, 1454, 1492, 1493, 1532-1535, 1560, 1561, 1566-1568, 
1571-1577, 1601-1607, 1615-1620, 1655, 1695, 1710-1712, 1728, 1729, 1758- 
1762, 1781, 1808, 1821, 1851, 1880, 1908-1913, 1925, 1940-1948, 1951, 1966- 

20 1969, 1999, 2001-2004, 2006-2014, 2024, 2048-2053, 2055-2057, 2071, 2088- 
2093. 2108, 2122-2148, 2165, 2187, 2226-2232, 2244-2249, 2267, 2281-2286, 
2288, 2289, 2325-2327, 2346, 2347, 2349, 2382, 2401, 2417-2422, 2439-2444, 
2446-2456, 2469, 2471-2476, 2495, 2533, 2534, 2573-2578, 2602-2604, 2606- 
2612, 2632-2638, 2660, 2676-2679, 2688-2693, 2707, 2721, 2757-2762, 2779, 

25 2794, 2795, 2797-2799, 2801, 2802, 2817-2843, 2863-2867, 2878-2884, 2886- 
2895. 
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Hie above objects are also achieved using HCV epitopes of the 

formula 

aa,-aay 

wherein aa denotes an amino acid; 
5 X and y are integers such that y-x ^ 6; 

aax-aay indicates a portion of the amino acid sequence of Figure 1; 
and X is select from the group consisting of 35 (where y is less than 

45), 80 (where y is less than 90), 95 (where y is less than 110), 99 (where y is 
less than 120), 100 (where y is less than 150), 190 (where y is less than 210), 500 
10 (where y is less than 550), 600 (where y is less than 625), 1260 (where y is less 
than 1280), 1569 (where y is less than 1931), 1570 (where y is less than 1590), 
1694 (where y is less than 1735), 1949 (where y is less than 2124), 1950 (where y 
is less than 1985), 2000 (where y is less than 2050), 2005 (where y is less than 
2025), 2054 (where y is less than 2223), 2250 (where y is less than 2330), 2287 
15 (where y is less than 2385), 2290 (where y is less than 2310), 2345 (where y is 
less than 2375), 2348 (where y is less than 2464), 2445 (where y is less than 
2475), 2470 (where y is less than 2490), 2605 (where y is less than 2620), 2780 
(where y is less than 2830), 2796 (where y is less than 2886), 2800 (where y is 
less than 2850), and 2885 (where y is less than 2905). 
20 In either of the above formula, x-y can less than or equal to 10, 20, 

30, 40 or 50 in some embodiments of the invention. 

Brief Description n f the Drawings 

Fig. 1 shows the polyprotein of the HCV prototype isolate HCVl. 
25 Fig. 2 shows a composite cDNA sequence for HCVl. 

Fig. 3 shows the nucleotide consraisus sequence of human isolate 
23, variant sequraices are shown below the sequence line. The amino acids 
encoded in the consensus sequence are also shown. 

Fig. 4 shows the nucleotide consensus sequence of human isolate 
30 27, variant sequences arc shown below the sequence line. The amino acids 
encoded in the consensus sequence are also shown. 
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Fig. 5 shows the aligned nucleotide sequ»ices of human isolates 23 
and 27 and of HCVl. Homologous sequences are indicated by the symbol (*). 
Non-homologous sequences are in small letters. 

Fig. 6 shows the aligned amino add sequences of human isolates 23 
5 and 27 and of HCVl. Homologous sequences are indicated by the symbol (*). 
Non-homologous sequences are in small letters. 

Fig. 7 shows a comparison of the composite aligned nucleotide 
sequences of isolates Thorn, ECl, HCT #18, and HCVl. 

Fig. 8 shows a comparison of the nucleotide sequences of EClO and 
10 a composite of the HCVl sequence; the EC 10 sequence is on the line above the 
dots, and the HCVl sequence is on the line below the dots. 

Kg. 9 shows a comparison of the amino acid sequences 117-308 
(relative to HCVl) encoded in the "EnvL" regions of the consensus sequences of 
human isolates HCT #18, JH23, JH 27, Thome, ECl, and of HCVl. 
15 Fig. 10 shows a comparison of the amino acid sequences 330-360 

(relative to HCVl) encoded in the "BivR" regions of the consensus sequences of 
human isolates HCT #18, JH23, JH 27, Thome, ECl, and of HCVl. 

Modes for Carrying Out the Invention 

20 

Complete citations to publications referred to herein can be found in 
the "Background" or "Bibliography" sections. 
I. Definitions 

"Hqmtitis C vims" or "HCV" refers to tiie art-iecognized viral 
25 species of which pathogaiic strains cause NANBH, and attenuated strains or 

defective interfering particles derived there&om. See generally, publications cited 
in the section entitied "Background." The HCV genome is comprised of BNA. It 
is known that RNA contaming viroses have relatively high rates of spontaneous 
mutation, i.e., reportedly on the order of 10"^ to 10^ per incorporated nucleotide 
30 (Fields & Kmpe (1986)). Therefore, since heterogeneity and fluidity of genotype 
are inherent in RNA viruses, there are multiple strains/isolates, which may be 
virulent or avirulent, within flie HCV species. The propagation, identification. 
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detection, and isolation of ttio various HCV strains or isdates has been well 
docum^ited in tiie literature. Moreover, tlie disdosoie lierein allows the 
preparation of (tiagnostics and vaccines for the various strains/isolates, as well as 
compositions and methods that have utilify in screening procedures for anti-viral 
agents for pharmacologic use, such as ag^ that inhibit replication of HCV. 

Infionnation on several differrait strains/isolates of HCV is disclosed 
herdn, particularly stram or isolate CDC/HCVl (also called HCVl). Moimation 
from one strain or isolate, such as a partial genomic or amino acid sequence, is 
sufficient to allow those skilled in the art usmg standard techniques to isolate new 
strains/isolates and to identify whether such new strains/isolates are HCV. For 
example, several different strams/isolates are described below. These strains, 
which were obtained from a number of human sera (and from different 
geographical areas), were isolated utilizing the information from the genomic 
sequence of HCVl. 

The information provided herein is indicative that HCV may be 
distantly related to the flaviviridae. The Havivirus family contains a large number 
of viruses which are small, enveloped pathogens of man. The morphology and 
composition of Flavivirus particles are known, and are discussed in Brinton 
(1986). Generally, with respect to moiphology, Flaviviruses contain a central 
nucleocapsid surrounded by a lipid bilayer. Virions are spherical and have a 
diameter of about 40-50 nm. HiMr cores are about 25-30 nm in diameter. Along 
the outer surface of the virion envelope are projections that are about 5-10 nm 
long with terminal knobs about 2 nm in diameter. Typical examples of the £smily 
include Ydlow Fever virus, West Nile virus, and Dengue Fever virus. They 
possess positive-stranded RNA genomes (~ 11,000 nucleotides) that are slightly 
larger than that of HCV and encode a polyprotein precursor of about 3500 amino 
adds. Individual vind proteins are cloved from this precursor polypeptide. 

The genomic structure and the nucleotide sequence of HCV genomic 
RNA has been deduced. The genome appears to be single-stranded RNA con- 
taining ~ 10,000 nucleotides. The genome is positive-stranded, and possesses a 
continuous, translational open reading frame (ORF) that encodes a polyprotein of 
about 3,000 amino acids. In the ORF, the structural protein(s) appear to be 
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8 



encoded in appixjximately tbe first quarter of the N-tenninus region, with the 
majority of the polyprotdn responsible for non-structural proteins. When 
compared wifli all known viral sequences, small but significant co-linear 
homologies are observed with the non-structural proteins of the flavivirus family, 
5 and with the pestiviruses (which are now also considered to be part of the 
Havivirus family). 



sequence of HCVl and other evidence, possible protein domains of flie encoded 
HCV polypiotein, as well as the approximate boundaries, are the following: 



Based upon the putative amino acids encoded in the nucleotide 



10 



Putative Domain 




famiTifi add nos.) 



C (nucleocapsid protein) 



1-191 



15 



B, (virion envelope protein) 



192-383 



Ej/NSl (envelope?) 



384-800 



20 



NS2 (unknown function) 



800-1050 



NS3 (protease?) 



1050-1650 



25 



NS4 (unknown function) 



1651-2100 



NS5 (polymerase) 



2100-3011(end) 
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These domains aie, however, tentative. For example, the E1-NS2 border is 
probably in the 750-810 region, and NS3-NS4 border is about 1640-1650. There 
is also evidence that tiie 191 aa version of C is a precinsor that is further 
5 processed (e.g. , to about 170 aa in length), and that the NS2, NS4 and NS5 
protems are eadi further processed into two mature proteins. 

Different strains, isolates or subtypes of HCV are expected to 
contain variations at the amino add and nucleic acids compared with HCVl. 
Many isolates are expected to show much (i.e., more than about 40%) homology 

10 in the total amino acid sequence compared with HCVl. However, it may also be 
found that there are other less homologous HCV isolates. These would be defined 
as HCV according to various criteria such as, for example, an ORF of 
approximately 9,000 nucleotides to approximately 12,000 nucleotides, encoding a 
polyprotein similar in size to that of HCVl, an encoded polyprotein of sunikr 

15 hydrophobic and/or antigenic character to that of HCVl, and the presence of co- 
linear peptide sequences that are conserved with HCVl. In addition, the genome 
would be a positive-stranded RNA. 

HCV encodes at least one qpitqpe which is immunologically 
identifiable with an epitope in the HCVl polyprot^. The epitope is unique to 

20 HCV when compared to previously known Haviviruses. The uniqueness of the 
epitope may be determined by its immunological reactivity with anti-HCV anti- 
bodies and lack of immunological reactivity with antibodies to known Fkvivirus 
speck&. Methods for determining immunological reactivity are known in the art, 
for example, by radioimmunoassay, by EUSA assay, by hemagglutination, and 

25 several examples of suitable techniques for assays are provided herein. 

Alternatively, a comparison of the sequence of the HCV epitope to previously 
known sequences of members of the Flavivirus family can be used to evaluate 
"uniqueness." 

In addition to the above, the following parameters of nucleic acid 
30 homology and amino acid homology are applicable, either alone or in 

combination, in identifying a strain/isolate as HCV. Since HCV strains and 
isolates are evolutionarily related, it is expected that the overall homology of the 
genomes at the nucleotide level may be about 10% or greater, probably will be 
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about 40% or greater, probably about 60% or greater, and even more probably 
about 80% or greater; and in addition that there will be corresponding contiguous 
sequences of at least about 13 nucleotides. It should be noted that variable and 
hypervariable regions within the HCV genome; dierefore, the homology in these 
5 regions is expected to be significantly less than that in the overall genome. The 
correspondence between the putative HCV strain genomic sequence and, for 
example, the CDC/HCVl cDNA sequence can be determined by techniques known 
in the art. For example, they can be determined by a direct comparison of the 
sequence infonnation of the polynucleotide from the putative HCV, and the HCV 

10 cDNA sequence(s) described herein. For example, also, they can be determined 
by hybridization of the polynucleotides under conditions which form stable 
duplexes between homologous regions (for example, those which would be used 
prior to Si digestion), followed by digestion with single stranded specific 
Quclease(s), followed by size determination of the digested fragments. 

15 Because of the evolutionary relationship of the strains or isolates of 

HCV, putative HCV strains or isolates are identifiable by their homology at the 
polypq>fide level. Generally, HCV strains or isolates are expected to be at least 
10% homologous, more than about 40% homologous, probably more tiian about 
70% homologous, and even more probably more than about 80% homologous, and 

20 some may even be more than about 90% homologous at the polypeptide level. 
The techniques for determining amino add sequence homology are known in the 
art. For example, the amino add sequence may be detmniaed directly and 
compared to the sequences provided herem. Altematively the nucleotide sequ^e 
of the genomic material of the putative HCV may be determined (usually via a 

25 cDNA intermediate), the amino acid sequence encoded therein can be determined, 
and the corresponding regions compared. 

As used herein, a polynucleotide "derived from" a designated 
sequence refers to a polynucleotide sequence which is comprised of a sequence of 
approximately at least about 6 nucleotides, preferably at least about 8 nucleotides, 

30 more preferably at least about 10-12 nucleotides, and even more preferably at least 
about 15-20 nucleotides corresponding to a region of the designated nucleotide 
sequence. "Corresponding" means homologous to or complementary to the 
designated sequence. Prrferably, the sequence of the region ftom which the 
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polynucleotide is derived is homologous to or complementary to a sequence which 
is unique to an HCV genome. Whether or not a sequence is unique to the HCV 
graome can be determined by techniques known to those of skill in the art. For 
example, the sequence can be compared to sequences in databanks (as of the 
5 priority date), eg. , Geneibank, to determine whether it is present in the umnfected 
host or other organisms. The sequence can also be compared to the known (as of 
the priority date) sequences of other viral agents, including those which are known 
to induce hepatitis, e.g. , HAV, HBV, and HDV, and to members of the 
Haviviridae. The correspondence or non-correspondence of the derived sequence 
10 to other sequences can also be determined by hybridization under the appropriate 
stringency conditions. Hybridization techniques for determining the 
complementarity of nucleic acid sequences are known in the art. See, for 
example, Maniatis et al. (1982). In addition, mismatches of duplex poly- 
nucleotides formed by hybridization can be determined by known techniques, 
15 including for example, digestion with a nuclease such as SI that specifically 
digests single-stranded areas in duplex polynucleotides. Regions firom which 
typical DNA sequences may be "derived" include but are not limited to, for 
example, regions encoding specific epitopes, as well as non-transcribed and/or 
non-translated r^ons. 

The derived polynucleotide is not necessarily physically derived 
from the nucleotide sequence shown, but may be generated in any manner, 
including for example, chemical synthesis or DNA replication or reverse 
transcription or transcription. In addition, combmations of regions corre^nding 
to that of the designated sequence may be modified in ways known in the art to be 
consistent with an intended use. 

Sunilarly, a polypeptide or amino acid sequence "derived from" a 
designated amino acid or nucleic add sequence refeis to a polypeptide having an 
amino acid sequence identical to that of a polypeptide encoded in the sequence, or 
a portion thereof wherein the portion consists of at least 3-5 amino acids, and 
more preferably at least 8-10 amino acids, and even more preferably at least 11-15 
amino acids, or which is immunologically identifiable with a polypeptide encoded 
in the sequence. This terminology also includes a polypeptide expressed from a 
designated nucleic acid sequence. 
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A recombinant or derived polypeptide is not necessarily translated 
ftom a designated nucleic acid sequence; it may be generated in any manner, 
including for example, chemical synthesis, or expression of a recombinant 
expression system, or isolation from HCV, including mutated HCV. A recombin- 
5 ant or derived polypeptide may include one or more analogs of amino acids or 
unnatural amino acids in its sequence. Methods of inserting analogs of amino 
acids into a sequence are known in the art. It also may include one or more 
labels, which are known to those of skill in the art. 

The term "recombinant polynucleotide" as used herein intends a 
10 polynucleotide of genomic, cDNA, semisynthetic, or synthetic origm which, by 
virtue of its origin or manipulation: (1) is not associated with all or a portion of a 
polynucleotide with which it is associated in nature, (2) is Imked to a 
polynucleotide other than that to which it is linked in nature, or (3) does not occur 
in nature. 

15 The term "polynudeotide" as used herdn refers to a polymeric form 

of nucleotides of any length, either ribonucleotides or deoxyribonudeotides. This 
term refers only to the pdmaiy structure of the molecule. Tbais, this tern 
includes double- and single-stranded DNA and RNA. It also inchides known types 
of modifications, for example, labels which are known in the art, mefihylation, 

20 "caps", substitution of one or more of flie naturally occurring nucleotides wifli an 
analog, intomucleotide modifications such as, for example, those with uncharged 
linkages ie.g., methyl phosphonates, phosphotriesters, phosphoramidates, 
carbamates, etc.) and with charged linkages (e.g., phosphorothioates, phosphoio- 
dithioates, etc.), those containing pendant moieties, such as, for example proteins 

25 (includmg for e.g., nucleases, toxms, antibodies, signal peptides, poly-L-lysine, 
etc.), those with intercalators (e.g., acridine, psoralen, etc.), those containing 
chelators (e.g., metals, radioactive metals, boron, oxidative metals, etc.), those 
containing alkylators, those with modified linkages (e.g. , alpha anomeric nucleic 
acids, etc.), as wdl as unmodified forms of the polynucleotide. 

30 A "puiffied" polypeptide refers to flie polypeptide bemg m a state 

that is substantially free of otiier polypqrtides, i.e. , in a composition that contains 
a Tninimnm of about 50% by wdght (desired polypq)tide/total polypqptide in 
composition), preferably a minimum of about 70%, and even more preferably a 
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minimum of about 90% of the desiied polypq}tide, without regaid to non- 
piDteinaceous materials in tiie composition. Techniques for puriiying vixal poly- 
peptides aie known in the art. Purified antiiiodies are similariy defined. 

"Recombinant host cdQs", "host cells", "cells", "celllines", "cell 
5 cultures", and odier such tenns denoting miciooiganisms or higher eukaiyotic cell 
lines cultured as unicellular entities refer to cells which can be, or have been, used 
as recipients for recombhiant vector or other transfer DNA, and include the 
progeny of the original ceU which has been transfected. It is understood that the 
progeny of a single parental cell may not necessarily be completely identical in 

10 morphology or in genomic or total DNA complement as the original parent, due to 
natural, accidental, or deliberate mutation. 

A "replicon" is any genetic element, e.g., aplasmid, a 
chromosome, a virus, a cosmid, etc. that behaves as an autonomous unit of 
polynucleotide replication within a cell; i.e., capable of rqjlication under its own 

15 control. 

A "vector" is a replicon in which another polynucleotide s^ent is 
attached, so as to bring about the replication and/or expression of the attached 
segment. 

"Control sequence" refers to polynucleotide sequences which are 
20 necessary to effect the ejcpression of coding sequences to which they are ligated. 
Hie nature of such control sequences differs depending upon the host organism; in 
prokaryotes, such control sequences generally include promoter, ribosomal binding 
site, and terminators; in eukaryotes, graierally, such control sequences inchide 
promoters, terminators and, in some instances, ^ihancers. The term "control 
25 sequences" is intended to include, at a minimum, all components whose presence 
is necessary for e:xpression, and may also include additional components whose 
pres^ice is advantageous, for example, leader sequences. 

"durably linked" refere to a juxtaposition wherein the components 
so described are in a relationship permitting them to function in their intended 
30 manner. A control sequence "operably linked" to a coding sequence is ligated in 
such a way that expression of the coding sequence is achieved under conditions 
compatible with the control sequences. 
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An "open reading frame" (OKF) is a region of a polynucleotide 
sequence which encodes a polypeptide; tibis region may iqjiesent a portion of a 
coding sequence or a total coding sequence. 

A "coding sequence" is a polynucleotide sequence which is 
5 transcribed into mRNA and/or translated into a polypeptide when placed under the 
control of appropriate regulatory sequences. The boundaries of the codmg 
sequence are determined by a translation start codon at the 5'-terminus and a 
translation stop codon at the S'-terminus. A codmg sequence can include, but is 
not Dmited to mENA, cDNA, and recombinant polynucleotide sequences. 

10 "Immunologically identifiable with/as" refers to the presence of 

epitope(s) and polypeptides(s) which are also present in the designated 
polypeptide(s), usually HCV proteins. Dnmunological identity may be detennmed 
by antibody binding and/or competition in binding; these techniques are known to 
those of average skill in the art. 

15 As used herein, "epitope" refers to an antigenic determinant of a 

polypq)tide. An epitope could comprise 3 or more amino adds that defines the 
binding site of an antibody. Generally an epitope consists of at least 5 amino 
acids, and sometimes consists of at least 8 amino acids. Methods of qntope 
mapping are known in the art. 

20 A polypeptide is "immunologically reactive" with an antibody when 

it binds to an antibody due to antibody recognition of a specific epitope contained 
within the polypeptide. Immunological reactivity may be determined by antibody 
bindmg, more particularly by tiie kinetics of antibody binding, and/or by competi- 
tion in bmding usmg as competitor(s) a known polypeptide(s) containing an epitope 

25 against which the antibody is directed. The techniques for determining whether a 
polypeptide is immunologically reactive with an antibody are known in the art. 

As used herein, the term "antibody" refers to a polypeptide or group 
of polypeptides which are comprised of at least one antibody combining site. An 
"antibody combining site" or "binding domain" is formed from the folding of vari- 

30 able domains of an antibody molecule(s) to form tiiree-dimensional binding spaces 
with an internal surface shape and charge distribution complementary to the 
features of an epitope of an antigen, which allows an immunological reaction with 
the antigen. An antibody combining site may be formed from a heavy and/or a 
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Kght chain domain (Vh and Vl, respectively), which fonn hypervariable loops 
which contribute to antigen binding. The tenn "antibody" includes, for exaniple, 
vertebrate antibodies, hybrid antibodies, chimeric antibodies, altered antibodies, 
univalent antibodies, ti^ Fab piotnns, and single domain antibodies. 
5 As used heidn, a "smgte domain antibody" (dAb) is an antibody 

which is comprised of an VH domain, which reacts immunologically with a 
designated antigen. A dAb does not contain a Vj, domain, but may contain other 
antigen binding domains known to exist in antibodies, for example, the kappa and 
lambda domains. Methods for preparing dAbs are known in the art. See, for 
10 example, Ward et al (1989). 

Antibodies may also be comprised of Vh and Vl domains, as well as 
other known antigen binding domains. Examples of these types of antibodies and 
methods for their preparation are known in the art (see, e.g. , U.S. Patent No. 
4,816,467, which is incoiporated herein by reference), and include the following. 
15 For example, "vertebrate antibodies" refers to antibodies which are trtramers or 
aggregates thereof, comprising light and heavy chains which are usually 
aggregated in a "Y" configuration and which may or may not have covalent link- 
ages between the chains. In vertebrate antibodies, the amino add sequraces of all 
the chains of a particular antibody are homologous with the chains found in one 
20 antibody produced by the lymphocyte which produces that antibody in situ, or in 
vitro (for example, in hybridomas). Vertebrate antibodies typically inchide native 
antibodies, for example, purified polyclonal antibodies and monoclonal antibodies. 
Examples of die methods for tiie preparation of these antibodies are described 

25 "Hybrid antibodies" are antibodies wherein one pair of heavy and 

light chains is homologous to those in a first antibody, while the otiier pair of 
heavy and light chains is homologous to those in a different second antibody. 
Typically, each of these two pairs will bind different epitopes, particularly on 
different antigens. This resuks in the property of "divalence", i.e., the ability to 

30 bind two antigens simultaneously. Such hybrids may also be formed using 
chimeric chains, as set forth below. 

"Chimeric antibodies", are antibodies in which the heavy and/or 
light chains are fusion proteins. Typically the constant domain of the chains is 
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ftom one paiticular species and/or class, and the variable domains axe ficom a 
different species and/or class. Also included is any antibody in wliich dther or 
both of the heavy or light chains are composed of combinations of sequences 
mimicking the sequences in antibodies of different sources, whether these sources 
5 be differing classes, or different species of origin, and whether or not the fusion 
point is at the variable/constant boundary. Thus, it is possible to produce 
antibodies in which neither the constant nor the variable region mimic known anti- 
body sequences. It then becomes possible, for example, to construct antibodies 
whose variable region has a higher specific affinity for a particular antigen, or 
10 whose constant region can elicit enhanced complement fixation, or to make other 
improvements in properties possessed by a particular constant region. 

Another example is "altered antibodies", which refers to antibodies 
in which the naturally occurring amino acid sequence in a vertebrate antibody has 
been varied. Utilizing recombinant DNA techniques, antibodies can be redesigned 
15 to obtain desired characteristics. The possible variations are many, and range 
from the changing of one or more amino adds to the complete sedesign of a 
i^on, for exampls, the constant region. Changes in the constant region, in 
general, to attain desired cellular process characteristics, e.g., changes in com- 
plement fixation, interaction with membranes, and other effector functions. 
20 Changes in the variable region may be made to alter antigen binding 

characteristics. The antibody may also be engiaeeced to aid the specific delivery 
of a molecole or substance to a specific cell or tissue site. The desired alterations 
may be made by known techniques in molecular biology, e.g., recombinant 
techniques, site directed mutagenesis, etc. 
25 Yet another example ate "univalent antitiodies", which are 

aggregates comprised of a heavy chain/light chain dimer bound to the Fc (i.e. , 
constant) region of a second heavy chain. This type of antibody escapes antigenic 
modulation. See, e.g., GlennieefaZ. (1982). 

Included also within the definition of antibodies are "Fab" fragments 
30 of antibodies. The "Fab" region refers to those portions of the heavy and light 
chains which are roughly equivalent, or analogous, to the sequences which 
comprise the branch portion of the heavy and li^t chains, and which have been 
shown to exbSbit immunological binding to a specified antigen, but which lack the 
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effector Fc poitiaa. "Fab" includes aggregates of one heavy and one light chain 
(commonly known as Fab'), as well as t^iamers containing the 2H and 2L chains 
(lefened to as F(ab)2), which are capable of selectively reacting with a designated 
antigen or antigen fiimily. "Fab" antibodies may be divided into subsets analogous 
5 to those described above, i.e, "vertebrate Fab", "hybrid Fab", "chimeric Fab", 
and "altered Fab". Methods of producing "Fab" ftagments of antibodies are 
known within the art and include, for example, proteolysis, and synthesis by 
recombinant techniques. 

Also included in the term "antibodies" are single-chain antigen- 
ic binding (SCA) protdns, such as the type described in the article co-authored by 
Schlom, J. in the June IS, 1992 issue of Cancer Research (as well as articles cited 
therein). 

As used herein, the term "immunogenic polypeptide" is a 
polypeptide that elicits a cellular and/or humoral immune response, whether alone 

15 or linked to a carrier in the presence or absence of an adjuvant. 

Hie term "polypq)tide" refers to a polymer of amino acids and does 
not refer to a specific length of the product; thus, pqrtides, oligopeptides, and 
proteins are included within the definition of polypeptide. This term also does not 
refer to or exdude post-e^^ression modifications of the polypqptide, for example, 

20 glycosylation, acetylation, phosphorylation and the like. Included within the 
definition are, for example, polypeptides containing one or more analogs of an 
amino add (induding, for example, unnatural amino acids, £/c.), polypq>tides 
with substituted linkages, as weU as other modifications known in the art, both 
naturally occurring and non-naturally occurring. 

25 "Transformation", as used herein, refers to the insertion of ah 

exogenous polynucleotide into a host cell, irrespective of the method used for the 
insertion, for example, direct uptake, transduction, f-mating or electroporation. 
The exogenous polynucleotide may be mainlined as a non-integrated vector, for 
example, a plasmid, or alternatively, may be integrated into the host genome. 

30 "Treatment" as used herein refers to prophylaxis and/or therapy. 

An "individual", as used herein, refers to vertebrates, particularly 
members of the mammalian species, and includes but is not limited to aninml; ! 
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(e.g., dogs, cats, cattle, swine, sheep, goat, rabbits, mice, rats, guinea pigs, etc.), 
and primates, including monkeys, chimps, baboons and humans. 

As used herein, the "sense strand" of a nucldc acid contains the 
sequence tiiat has sequence homology to that of mRNA. The "anti-sense strand" 
5 contains a sequence which is complementary to that of the "sense strand". 

As used herein, a "positive stranded genome" of a virus is one in 
which the genome, whether RNA or DNA, is single-stranded and which encodes a 
viral polypeptide(s). Examples of positive stranded RNA viruses include 
Togaviridae, Coronaviridae, Retroviridae, Picomaviridae, and CaHciviridae. 
10 Included also, are the Flaviviridae, which were formerly classified as Togaviradae. 
See Fields & Knipe (1986). 

As used herein, "antibody containing body component" refers to a 
componoat of an individual's body which is a source of the antibodies of interest. 
Antibody containing body components are known in the art, and include but are 
15 not limited to, for example, plasma, serwn, spinal fluid, lymph fluid, the external 
sections of tiie respiratory, intestinal, and genitourinary tracts, tears, saliva, milk, 
white blood cells, and myelomas. 

As used herein, a "biological sample" refers to a sample of tissue or 
fluid isolated from an individual, including but not limited to, for example, 
20 plasma, serum, spinal fluid, lymph fluid, the external sections of the skin, respir- 
atory, intestinal, and geoitourinary tracts, tears, saliva, milk, blood cells, tumors, 
organs, and also samples of in vitro cell culture constituents (including but not 
limited to conditioned medium resulting fvam the growth of cells in cell culture 
medium, {nitatively virally infected cells, recombinant cells, and cell components). 

25 

n. Description of the Invention 

The practice of the present invention wiU employ, unless otherwise 
indicated, conventional techniques of molecular biology, microbiology, 
recombinant DNA, and immunology, which are within the sldll of the art. Such 
30 techniques are explained fully in the literature. Sme.g., Maniatis, Fitsch & 

Sambrook, "Molecular Cloning; A Laboratory Manual" (1982); "DNA Cloning, 
Volumes I and E" (D.N Glover ed. 1985); "Oligonucleotide Synthesis" (M.J. Gait 
ed, 1984); "Nuddic Acid Hybridization" (B.D. Bbunes & S.J. Higgins eds. 1984); 
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"Transcription and Translation" (B.D. Hames & S.J. Higgins eds. 1984); 
"Animal CeU Cultaie" (R.I. Fiesbney ed. 1986); "Xnunobilized Cells And 
Enzymes" (ERL Press, 1986); B. Peibal, "A Practical Guide To Molecular 
Cloning" (1984); tiie series, "Methods in Enzymology" (Academic Press, Inc.); 
5 "Gene Transfer Vectors For Mammalian Cells" (J.H. Miller and M.P. Calos eds. 
1987, Cold Spring Harbor Laboratory), Meth Enzvmol Vol. 154 and Vol. 155 
(Wu and Grossman, and Wu, eds., respectively), Mayer and Walker, eds. (1987), 
"Immunochemical Methods In Cell And Molecular Biology" (Academic Press, 
London); Scopes, (1987) "Protein Purification: Principles and Practice", Second 
10 Edition (Springer-Verlag, N.Y.); and "Handbook of Experimental Immunology", 
Volumes I-IV (D.M. Weir and C. C. Blackwell eds 1986). AU patents, patent 
applications, and publications mentioned herein, both supra and infra, are hereby 
incorporated herein by reference. 

15 n.A. Truncated HC V Polypeptides 

The useful materials and processes of tiie piesrait invention are 
made possible by the idaitification below of new HCV q)itopes. The knowledge 
of tiiese qpitqpes (or antigenic regions) allows for construction of polype^itides 
containirig truncated HCV sequences which can be used as immunological 

20 reagaits. 

Truncated HCV amino add sequences encoding at least one viral 
qpitope are useful immunological reagents. For example, polypeptides comprising 
such truncated sequences can be used as reagents in an immunoassay. These 
polypeptides also are candidate subunit antigens in compositions for antiserum 

25 production or vaccines. While these truncated sequmces can be produced by 

various known treatments of native viral protein, it is generally preferred to make 
synflietic or recombinant polypeptides comprismg an HCV sequence. Polypeptides 
comprising tiiese truncated HCV sequences can be made up entirely of HCV 
sequences (one or more epitopes, either contiguous or noncontiguous), or HCV 

30 sequences and heterologous sequences in a fusion protein. Useful heterologous 
sequences include sequences that provide for secretion from a recombinant host, 
enhance tiie unmunological reactivity of the HCV epitope(s), or facilitate the 
coupling of the polypq)tide to an immunoassay support or a vaccine carrier. See, 
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e.g., EPO Pub. No. 116,201; U.S. Pat. No. 4,722,840; EPO Pub. No. 259,149; 
U.S. Pat. No. 4,629,783, the disclosures of which are incorporated herdn by 
reference. 

Hie size of polypeptides comprising the truncated HCV sequences 

5 can vary widely, the minimum size being a sequence of sufficient size to provide 
an HCV epitope, while the maximum size is not critical. For convenience, the 
maxunum size usually is not substantially greater than that required to provide the 
desired HCV epitopes and function(s) of the heterologous sequence, if any. 
Typically, the truncated HCV amino acid sequence wiU range from about 5 (or 8) 

0 to about 100 amino acids in length. More typically, however, the HCV sequence 
will be a maximum of about 50 (or 40) amino adds in length, and sometimes a 
maximum of about 20, 25 or 30 amino adds. It is usually desirable to select 
HCV sequences of at least about 8, 10, 12 or 15 amino acids. 

Examples of truncated HCV amino add sequences (octamers) that 

5 are useM as described hetdn are set forth bdow in the examples. It is to be 
nnderstood that these peptides do not necessarily predsdy map one epitope. Non- 
immunogenic portions of the sequence can be defined using conventional 
techniques and ddeted ficom the described sequences. Forflier, additional trun- 
cated HCV amino acid sequences that comprise an qritope or are immunogenic 

[) can be identified as described herein. 

Polypeptide products containing the truncated HCV amino acid 
sequences disclosed below can be prqiared as discrete peptides or incorporated 
into a larger polypeptide, and may find use as described herein. la ptefened 
appKcations, truncated sequences from the El and/or E2 domains have applications 

) in vacdne and therapeutic products. While generally any of the domains can have 
some diagnostic utility, C, NS3, NS4 and NS5 are particularly preferred, with 
combinations of C q)itopes with qjitopes from one or more of the NS3, NS4 or 
NS5 domains bdng particulariy preferred. 

) n.B. Preparation of Polypeptides 

Ihe availabilily of DNA sequences encoding HCV amino acid 
sequences permits the construction of raqpression vectors encoding antigenically ac- 
tive regions of the polypeptide (See, e.g.. Fig. 2). These antigenically active 



wo 93/00365 _21- PCr/US92/05388 

regions may be (feiived from coat or envelope antigens or from core antigens, or 
ftom antigens which are non-stroctuial inchiding, for example, polynucleotide 
binding protdns, polynucleotide polymerase(s), and otiier viial proteins required 
for tile r^lication and/or assembly of the virus paiticle. Fragments ^icoding the 
5 desired polypqitides are derived, for example, ftom viral cDNA clones usmg 
conventional restricticni digestion or by synthetic methods, and are ligated iato 
vectors wluch may, for example, contata portions of fusion sequences such as 
)8-GaIactosidase or superoxide dismutase (SOD), preferably SOD. Methods and 
vectors which are useful for the production of polypeptides which contain fusion 
10 sequences of SOD are described in European Patent Office Publication number 
0196056, published October 1, 1986. Vectors encoding fusion polypeptides of 
SOD and HCV polypeptides, i.e., NANB3.1.1, NANBg,, and ClOO-3, which is 
encoded in a composite of HCV cDNAs, are described in Sections IV.B.l, 
IV.B.2, and IV.B.4, respectively. Any desired portion of flie HCV cDNA 
15 containing an open reading frame (or a synthetic version thereof), can be used to 
express a recombinant polypqrtide, such as a mature or fusion protein. 

Alternatively, a polypeptide contain HCV epitopes can be provided 
by chemical synthesis using standard techniques based on the amino acid sequence 
of the figoKS and the examples. 
20 The DNA encoding flie desired polypeptide, whetiier in fused or 

unfused form, and wh^er or not containing a signal sequence to permit secretion, 
may be ligated into expression vectors suitable for any convenient host. Both 
eukaryotic and prokaryotic host systems are presentiy used in forming recombinant 
polyp€5)tides, and host cell lines is given in EPO Pub. Nos. 318,216. ITie 
25 polypeptide is then isolated from lysed cells or from the culture medium and pur- 
ified to the extent needed for its intended use. Purification may be by techniques 
known in the art, for example, differential extraction, salt fractionation, 
chromatography on ion exchange resins, affinity chromatography, centiifugation, 
and the like. See, for example, Metiiods in Enzymology for a variety of methods 
30 for purifying proteins. Such polypeptides can be used as diagnostics, or those 
which give rise to neutralizing antibodies may be formulated into vaccines. Anti- 
bodies raised against these polypeptides can also be used as diagnostics, or for 
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passive immunotherapy. In addition, as discussed below, antibodies to these 
polypeptides are useful for example in isolating and identifying HCV paiticles. 

The HCV polypeptides may also be isolated from HCV virions and 
tiiincated (if not akeady). Hie virions may be grown in HCV infected cells in 
5 tissue cultute, or in an infected host. 

ILC. Preparation of Antigenic Polvoeotides a nd Conjugation with Carrier 
An antigenic region of a polypeptide is generally relatively 
small-^ically 8 to 10 amino acids or less in length. Fragments of as few as 5 
10 amino adds may characterize an antigenic region. Th^e segments may cor- 
respond to regions of HCV antigen. Accordingly, using the cDNAs of HCV as a 
basis, DNAs encoding short segmaits of HCV polypq)tides can be ejcpressed 
lecombinanfly either as fusion proteins, or as isolated polypeptides. In addition, 
short amino acid sequences can be conveniently obtained by chemical syntiiesis. 
15 In instances wherein tiie syntiiesized polypeptide is correctly configured so as to 
provide the conect epitope, but is too small to be immunogenic, tiie polypeptide 
may be linked to a suitable carrier. 

A number of techniques for obtaining such linbige are known in the 
art, induding the formation of disulfide linkages using N-succinimidyl-3-^- 
20 pyridylfliio)propionate (SPDP) and sucdnimidyl 

4-(N-maleimido-methyl)cycIohexane-l-carboxylate (SMCQ obtained firom Keice 
Company, Eockford, Illinois, (if flie peptide lacks a sulfhydryl group, this can be 
provided by addition of a cysteine residue.) These leagants raeate a disulfide 
linkage between themselves and peptide cysteine residues on one protein and an 
amide linkage through the epsilon-amino on a lysine, or other firee amino group in 
the otiier. A variety of such disulfide/amide-forming agents are known. See, for 
example, Immun Rev (1982) 62:185. Other bifiinctional coupling agents form a 
thioetfaer rather than a disulfide linkage. Many of these thio-etiier-fonning agents 
are commercially available and include reactive esters of 6-maleiniidocaproic acid, 
2-bromoacetic acid, 2-iodoacetic acid, 

4-(N-maleiniido-methyl)cyclohSHcane-l-caTboxylic add, and the like. The carfaoxyl 
groups can be activated by combining them with succininude or l-hydroxyl-2 
nitro-4-sulfonic add, sodium salt. Additional methods of coupling antigens 
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employs the rotavjius/"biiidm£ peptide" system desciibed in EPO Pub. No. 
259,149, the disclosure of which is incoipoiated herein by reference. Hie 
foi^oing list is not meant to be e}diaiistive, and modifications of the named 
compounds can clearly be used. 
5 Any carrier may be used which does not itself induce the production 

of antibodies harmful to the host. Suitable carriers are typically large, slowly 
metabolized macromolecules such as proteins; polysaccharides, such as latex 
iunctionalized Sephaiose*, agarose, cellulose, cellulose beads and the like; 
polymeric amino acids, such as polyglutamic acid, polylysine, and the like; amino 
10 acid copolymers; and inactive virus particles, see, for example. Section II.D. 
Especially useful protein substrates are serum albumins, keyhole limpet 
hemocyanin, immunoglobulin molecules, thyroglobuUn, ovalbumin, tetanus toxoid, 
and other proteins well known to those skilled in the art. 

In addition to full-length viral proteins, polypeptides comprising 

15 

n.D. Preparation of gybod Particle Immunpgens rplTtaming HCy WpHopes 

The immunogenicity of the qutopes of HCV may also be enhanced 
by piepanng tiiem in mammalian or yeast systems fused with or assanbled with 
particle-forming proteins such as, for example, that associated with hepatitis B 

20 surface antigen. See, e.g., US 4,722,840. Constructs wherein tiie HCV q)itope 
is linked directiy to the particle-forming protein coding sequences produce hybrids 
which are immunogenic with respect to the HCV epiiape. In addition, all of flie 
vectors prepared include epitopes specific to HBV, having various degrees of 
immunogenicity, such as, for example, tiie pre-S peptide. Thus, particles 

25 constiucted from particle forming protein which include HCV sequraces are im- 
munogenic witii respect to HCV and HBV. 

Hepatitis surfece antigen (HBSAg) has been shown to be formed and 
assembled into particles iu S. cerevisiae (P. Valenzuela et al. (1982)), as well as 
in, for example, mammalian cells (P. Valenzuela et al. (1984)). The formation of 

30 such particles has been shown to enhance the immunogenicity of the monomer 

subunit. The constructs may also include the immunodominant epitope of HBSAg, 
comprising the 55 amino acids of flie presurface ^re-S) region. Neurath et al. 
(1984). Constructs of tiie pre-S-HKSAg particle expressible in yeast are disclosed 
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in EPO 174,444, published MaaA. 19, 1986; hybrids inchiding heterologous viral 
sequences for yeast expression are disclosed in EPO 175,261, published March 26, 
1966. These constructs may also be expressed in mammalian cells such as 
Chinese hamster ovary (CHO) ceUs using an SV40-dihydrofolate reductase vector 
5 (MicheUe era/. (1984)). 

In addition, portions of the particle-forming protein coding sequence 
may be replaced with codons encoding an HCV epitope. In this replacement, 
regions which are not required to mediate the aggregation of the units to form 
immunogenic particles in yeast or mammals can be deleted, thus eliminating 
10 additional HBV antigenic sites from competition with the HCV epitqpe. 



Vacdnes may be prepared firam one or more ioraiunogenic 
polypq)tides derived firan HCV. These polypq>tides may be expressed in 

15 various host cells ie.g. , bacteria, yeast, insect, or m a mmalian cells), or 
altemativdy may be isolated ficom viral prqarations or made synthetically. 
Single- or nmlti-valent vaccines against HCV may be comprised of one or more 
qntopes ficom one or more structural proteins, and/or one or more epitopes from 
one or more nonstructural protans. These vaccines may be comprised of, for 

20 example, recombinant HCV polypqjtides and/or polypeptides isolated from the 
virions. In particular, vacdnes are contranplated comprising one or more of the 
following HCV protems, or subunit antigens derived therefrom: El, E2, C, NS2, 
NS3, NS4 and fK5. Particularly preferred are vaccines comprising El and/or E2, 
or subunits thereof. 

25 In addition to tiie above, it is also possible to prepare live vaccines 

of attenuated microorganisms which express one or more recombinant HCV 
polypeptides. Suitable att^uated microorganisms are known in the art and 
include, for example, viruses {e.g. , vaccmia virus (see K»wn et al. (1986)), as 
well as bacteria. 

30 The prqjaration of vaccines which contain an immunogenic 

polypqjtide(s) as active ingredients, is known to one skilled in the art. Typically, 
such vacdnes are prepared as injectables, either as liquid solutions or suspensions; 
solid forms suitable for solution in, or suspension in, liquid prior to injection may 
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also be ptepaxed. Hie piepaiadoii may also be annilsified, or liie protein 
encapsulated in ]qx)somes. The active immimogenic ingredients axe often mixed 
with exclpients which are phatmaceutically acceptable and compatible with the 
active ingredient. Suitable exdpients are, for example, wat»*, saline, dextrose, 
5 glycerol, ethanol, or the lite and combinations thraeof . In addition, if desked, the 
vaccine may contain minor amcmnts of auxiliary substances such as wetting or 
emulsifying agents, pH buffering agents, and/or adjuvants which enhance the 
effectiveness of the vaccine. Examples of adjuvants which may be effective 
include but are not limited to: aluminum hydroxide, N-acetyl-muramyl-L-threonyl- 
10 D-isoglutamine (thr-MDP), N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 
11637, referred to as nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L- 
alanine-2-(l '-2'-dipalmitoyl-sn-glycero-3-hydroxyphosphoryloxy)-ethylamine (CGP 
19835A, referred to as MTP-PE), and EIBI, which contains three components 
extracted from bacteria, monophosphoryl lipid A, trehalose dimycolate and cell 
15 wall skeleton (MPL+TDM+CWS) in a 2% squalene/Tween* 80 emulsion. The 
effectivenras of an adjuvant may be determined by measuring the amount of 
antibodies directed against an immunogenic polypeptide containing an HCV 
antigenic sequence resulting from administration of this polypeptide in vaccines 
which are also comprised of the various adjuvants. 
20 The vaccines are conventionally administered parenterally, by injec- 

tion, for example, either subcutaneously or intramuscularly. Additional 
formulations which are suitable for other modes of administration include 
suppositories and, in some cases, oral formulations. For suppositories, traditional 
binders and carriers may include, for example, polyalkyleiie glycols or 
25 trigllycerides; such suppositories may be formed from mixtures containing the ac- 
tive ingredient in tiie range of 0.5 % to 10 % , preferably 1 %-2 % . Oral formu- 
lations include such normally employed excipients as, for example, pharmaceutical 
grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, 
cellulose, magnesium carbonate, and the like. These compositions take the form 
30 of solutions, suspensions, tablets, piUs, capsules, sustained release formulations or 
powders and contain 10%-95% of active ingredient, preferably 25%-70%. 

The proteins may be formulated into the vaccine as neutral or salt 
forms. Pharmaceutically acceptable salts include tiie add addition salts (formed 
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with free amino groups of the peptide) and wMch are fonned with inorganic adds 
such as, for example, hydrochloric or phosphoric adds, or sach organic acids such 
as acetic, oxalic, tartaric, maleic, and the like. Salts formed with the ftee 
carboxyl groups may also be derived ficom inorganic bases such as, for example, 
5 sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic 
bases as isopropylamine, trimethylamine, 2-ethylaniino ethanol, histidine, 
procaine, and the like. 



II.F. Dosage and A HmintfitTarinn of Vaccines 

10 The vaccines are administered in a manner compatible with the 

dosage formulation, and in such amount as will be prophylactically and/or 
tOierapeutically effective. The quantity to be administered, which is generally in 
the range of 5 /tg to 250 /tg of antigen per dose, depends on the subject to be 
treated, capacity of the subject's immune system to synthesize antibodies, and the 

15 degree of protection desired. Precise amounts of active ingiedieot required to be 
administered may depend on the judgment of the practitioner and may be peculiar 
to each subject. 

The vaccine may be given in a single dose schedule, or preferably 
in a multq>le dose schedule. A mul^le dose schedule is one in which a primary 

20 course of vacdnation may be with 1-10 separate doses, followed by other doses 
given at subsequent time intervals required -to maintain and or reenforce the 
immune response, for example, at 1-4 months fbr a second dose, and if needed, a 
subsequent dose(s) after sevraal months. The dosage regmien will also, at least in 
part, be determined by the need of flie individual and be dependent vpon the 

25 judgment of the practitioner. 

In addition, the vaccine containing the immunogenic HCV antigen(s) 
may be administered in conjunction with other immunor^Iatory agents, for 
example, immune globulins. 



30 II.G. Preparation of Antibodies Against HCV Epitopes 

Hie immunogenic polypq)tides described herein are used to produce 
antibodies, including polyclonal and monoclonal. If polyclonal antibodies are 
desired, a selected mammal ie.g., mouse, rabbit, goat, honse, etc.} is immunized 
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with an immunogenic polypeptide beazing an HCV qpitqpe(s). Serum £tom the 
immunized animal is collected and treated accorduig to known procedures. If 
senmi containing polyclonal antibodies to an HCV epitope contains antibodies to 
other antigens, Hie polyclonal antibodies can be purified by immumMifimty 
5 chromatography. Techniques for produdng and processing polyclonal antiseia are 
known in the art, see for example, Mayer and Walker (1987).Alteniatively, 
polyclonal antibodies may be isolated fitom a mammal which has been previously 
infected with HCV. 

Monoclonal antibodies directed against HCV epitopes can also be 
10 readily produced by one skilled in the art. The general methodology for making 
monoclonal antibodies by hybridomas is well known. Immortal antibody- 
producing cell lines can be created by cell fusion, and also by other techniques 
such as direct transformation of B lymphocytes with oncogenic DNA, or 
transfection with ]^stein-Barr virus. See, e.g., M. Schreier et al. (1980); 
15 Hammerling et al. (1981); Kennett et al. (1980); ai§Q, U.S. Patait Nos. 

4,341,761; 4,399,121 ; 4,427,783; 4,444,887; 4,466,917; 4,472,500; 4,491,632; 
and 4,493,890. Panels of monodonal antibodies produced against HCV q)itcpes 
can be screened for various properties; i.e. , for isotype, epitope afRnity, etc. 

Antibodies, both monoclonal and polyclonal, which are directed 
20 against HCV epitopes are particularly useful in diagnosis, and those which ate 
neutralizing are use&l in-passive- immunoflieraj^. Monoclonal antil)odies,- in ■ 
particular, may be used to raise anti-^otype antibodies. 

Anti-idiotype antibodies are immunoglobulins which carry an 
"internal image" of the antigen of the infectious agent against which protection is 
25 desired. See, for example, Nisonoff, A.,etaL (1981) and Dreesman et al. 

(1985). Techniques for raising anti-idiotype antibodies are known in the art. See, 
for example, Grzych (1985), MacNamara et al. (1984), and Vytdehaag et al. 
(1985). These anti-idiotype antibodies may also be useful for treatment, 
vaccination and/or diagnosis of NANBH, as well as for an elucidation of the 
30 immunogenic regions of HCV antigens. 
n.H. Immunoassav and Dia^ostic Kits 

Both the polypeptides and the antibodies of the present invention are 
useful in immunoassays to detect presence of HCV antibodies, or the presence of 
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the viius and/or HCV polypeptides (or q>itopes), in, for example, biological 
samples. Design of the immunoassays is subject to a great deal of variation, and 
many fonnats are known in the art. The immunoassay wiU utilize at least one 
viral epitope derived from HCV. In one ranbodiment, tiie immunoassay uses a 
5 combination of viral epitopes derived from HCV. These epitopes may be derived 
from the same or from different viral polypeptides, and may be in separate 
recombinant or natural polypeptides, or together in the same recombinant polypep- 
tides. An immunoassay may use, for example, a monoclonal antibody directed 
towards a viral epitope(s), a combination of monoclonal antibodies directed 
10 towards epitopes of one viral antigen, monoclonal antibodies directed towards 
epitopes of different viral antigens, polyclonal antibodies directed towards the 
same viial antigen, or polyclonal antibodies directed towards different viral 
antigens. Protocols may be based, for example, upon competition, or direct re- 
action, or sandwich type assays. Protocols may also, for example, use solid sup- 
15 ports, or may be by immunopredpitation. Most assays involve the use of labeled 
antibody or polypeptide; the labels may be, £ar example, enzymatic, fluorescent, 
chemiluminescent, radioactive, or dye molecules. Assays which amplify the 
signals ficom the probe are also known; examples of which are assays which utilize 
biotm and avidia, and enzyme-labeled and mediated immunoassays, such as 
20 ELESA assays (described below). 

Typically, an immunoassay for an anti-HCV antibody(s) wiH involve 
selecting and preparing the test sample suspected of containing the antibodies, such 
as a biological sample, then incubating it with an antigenic (i.e., epitope- 
containing) HCV polypeptide(s) under conditions that allow antigen-antibody 
25 complexes to form, and then detecting the formation of such complexes. Suitable 
incubation conditions are well known in the art. The immunoassay may be, 
without limitations, in a heterogenous or in a homogeneous format, and of a 
standard or competitive type. 

In a heterogeneous format, the polypeptide is typically bound to a 
30 solid support to facilitate separation of the sample from the polypeptide after 

incubation. Examples of solid supports that can be used are nitrocellulose (e.g., in 
membrane or microtiter well form), polyvinyl chloride ie.g., in sheets or 
miootiter wells), polystyrene latex (e.g. , in beads or microtiter plates, 
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polyviityHdine fluoride (known as Immulon*''), diazotized p^r, nylon membranes, 
activated beads, and Protein A beads. For example, Dynatech Immulon® 1 or 
Mmulon® 2 mianotiter plates or 0.25 inch polystyraie beads (Precision Plastic 
Ball) can be used in the heterog«Kous format. The solid support containing the 
5 antigradc polypq)tide is t^ically washed after sq)aratmg it from the test sample, 
. and prior to detection of bound antibodies. Both standard and competitive formats 
are known in the art. 

In a homogeneous fiormat, the test sample is incubated with antigen 
in solution. For example, it may be under conditions that will precipitate any 
10 antigen-antibody compleses which are formed. Both standard and competitive 
formats for these assays are known in the art. 

In a standard fonnat, the amount of HCV antibodies forming the 
antibody-antigen complex is dkectly monitored. This may be accomplished by 
determining whether labeled anti-xenogenic (eg. , anti-human) antibwdies which 
15 recognize an epitope on anti-HCV antibodies will bmd due to complex formation. 
In a competitive format, the amount of HCV antibodies in the sample is deduced 
by monitoring the competitive effect on the binding of a known amount of labeled 
antibody (or other competing ligand) in the complex. 

Complexes formed comprismg anti-HCV antibody (or, m the case of 
20 competitive assays, the amount of competuig antibody) are detected by any of a 
number of known techniques, depending on the fonnat. For example, unlabeled 
HCV antibodies in the complex may be d^ected using a conjugate of 
antixenogeneic Ig complexed with a label, ie.g., an enzyme label). 

In immunoassays where HCV polypeptides are the analyte, the test 
25 sample, typically a biological sample, is incubated with anti-HCV antibodies under 
conditions that allow the formation of antigen-antibody complexes. Various 
formats can be employed. For example, a "sandwich assay" may be employed, 
where antibody bound to a solid support is mcubated with the test sample; washed; 
incubated with a second, labeled antibody to the analyte, and the support is washed 
30 again. Analyte is detected by determining if the second antibody is bound to the 
support. In a competitive format, which can be either heterogeneous or 
homogeneous, a test sample is usually incubated with antibody and a labeled. 
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competing antigea is also incubated, dther sequentially or siniultaneously. Thwe 
and other formats are well known in the art. 

Efficient detection systems for HCV infection may include the use 
of panels of epitopes, as described above. The epitopes in the panel may be 
5 constructed into one or multiple polypeptides. The assays for the varying epitopes 
may be sequential or simultaneous. 

The enzyme-linked immunosorbent assay (ELIS A) can be used to 
measure either antigen or antibody concentrations. This method depends upon 
conjugation of an enzyme to either an antigen or an antibody, and uses the bound 

10 enzyme activity as a quantitative label. To measure antibody, the known antigen 
is fixed to a soHd phase (e.g. , a miaxjplate or plastic cup), incubated with test 
serum dilutions, washed, incubated with anti-immunoglobulin labeled with an 
enzyme, and washed again. Enzymes suitable for labeling are known in the ait, 
and inchide, for example, horseradish peroxidase. Enzyme activity bound to flie 

15 solid phase is measured by adding the specific substrate, and detennining product 
formation or substrate utilization colorimetrically. The enzyme activity bound is a 
direct function of the amount of antibody bound. 

To measure antigea, a known specific antibody is fixed to the solid 
phase, the test material containing antigen is added, after an incubation the solid 

20 phase is washed, and a second Mzyme-labded antibody is added. After washing, 
substrate is added, and enzyme activity is estimated colorimetrically, and related to 
antigen concentration. Kits suitable for immunodiagnosis and containing the 
appropriate labeled reagents are constructed by packaging the ai^ropiiate 
materials, including the polypeptides of the invention containing HCV epitopes or 

25 antibodies directed against HCV epitopes in suitable containers, along witii the 
lemaining reagents and materials required for the craiduct of the assay, as well as 
a suitable set of assay instructions. 



m. General Methods 

30 The general techniques used in the practice of tiie present invention 

can be found in, for example, the references cited herein, particularly EPO Pub. 
Nos. 318,216 and 388,232, as well as tiie references in die bibliography, which 
are incorporated herein by reference. 
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IV. Examples 

Described bdow are examples of flie present invention which are 
provided only for illustrative purposes, and not to limit the scope of the present 
invention. In Ught of the present disclosure, numerous embodiments within the 
5 scope of the claims will be appsaesA to those of oidinaiy glHil in the ait. 

IV.A. Epitope Mapping of HCV Genome 

The following example is the result of an epitope mapping 
experiment conducted on the HCVl polyprotem sequence shown in Fig. 1. As 

10 shown in Fig. 3-11, there are heterogeneities among HCV isolates, indicating that 
these amino acid substitutions can be made in the octamers described below. In 
addition to substitutions with amino adds from the corresponding location in other 
HCV isolates, substitotions with synthetic analogs of the particular amino adds or 
conservative substitutions based on cbaige, etc. (pacticulatly when the substitution 

15 does not destroy antibody binding) are within the scope of the invention. 

IV.A.1. Synthesis of Overlapping Pqrtides 

Polyethylene pins arranged on a blodc in an 8 x 12 array (Coseico 
Munetopes, Victoiia, Australia) were prepared by placing the pins in a bath (20% 

20 v/v pipecidme in dimethylfonnamide (DMF)) fior 30 minutes at room temperature. 
The pins were then removed, washed in DMF for S min, then washed in methanol 
ftjur times (2 min/wash). The pins were allowed to air diy for at least 10 min, 
then washed a final time in DMF (5 min). 1-Hydroxybenzotriazole (HOBt, 367 
mg) was dissolved in DMF (80 mL) for use in coupling Fmoc-protected amino 

25 acids: Fmoc-L-Ala-OPQ), Fmoc-L-Cys(Trt)-OPfy, Fmoc-L-Asp(O-fBu)-OPQ), 

Fmoc-L-Glu(0-fBu)-OP^, Fmoc-L-Phe-OP^, Fmoc-Gly-OP^, Fmoc-L-His(Boc)- 
OP^, Fmoc-L-ne-OPQ), Fmoc-L-Lys(Boc)-OP^, Fmoc-L-Leu-OPQ), Fmoc-L-Met- 
OPQ), Fmoc-L-Asn-OPfp, Fmoc-L-Pro-OP^, Fmoc-L-Ghi-OP^, Fmoc-L- 
Arg(Mtr)-OP{ip, Fnioc-L-Ser(r-Bu)-ODhbt, Fmoc-L-Thr(r-Bu)-ODhbt, Fmoc-L-Val- 

30 OP^, and Fmoc-L-Tyr-OR^. 

Hie protected amino adds were placed in microtiter plate wells with 
HOBt, and the pin blr^ placed over tfie plate, immersing the pins in the wdls. 
The assembly was then sealed in a plastic bag and allowed to react at 25'C for 18 
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hours to couple the first amino acids to the pins. The block was then removed, 
and the pms washed wifli DMF (2 min), MeOH (4x2 min), and again with 
DMF (2 min) to clean and deprotect the bound amino acids. Hie procedure was 
repeated for each additional amino acid coupled, until all octamers had been pre- 
5 pared. 

The ftee N-termini were then acetylated to compensate for the free 
amide, as most of the epitopes are not found at the N-terminus and thus would not 
have the associated positive charge. Acetylation was accomplished by filling the 
wells of a microtiter plate with DMF/acetic anhydride/triethylamine (5:2:1 v/v/v) 
10 and allowing the pins to react in the wells for 90 min at 20*'C. The pins were 
then washed with DMF (2 min) and MeOH (4x2 min), and air dried for at least 
10 min. 

The side chain protecting groups were removed by treating the pins 
with trifluoroacetic acid/phenol/dithioethane (95:2.5:2.5, v/v/v) in polypropylene 
15 bags for 4 hours at room temperature. The pins were then washed in dicMoro- 
methane (2x2 min), 595 di-isopnjpylefhylamine/dichlorom«hane (2x5 min), 
dicUoromethane (5 min), and air-dried for at least 10 min. The pins were then 
washed in water (2 min), MeOH (18 hours), dried in vacuo, and stored in sealed 
plastic bags over silica gel. 

20 

IV.A.2. Assay of Peptides 

Octamer-bearing pins prepared as above were first treated by son- 
icating for 30 min in a disruption buffer (1 96 sodium do^cylsulfate, 0.1% 2-mer- 
captoethanol, 0.1 M NaHiP04) at eO^C. The pins were then immersed several 

25 times in water (60°C), followed by boiling MeOH (2 min), and allowed to air dry. 
The pins were then precoated for 1 hour at 25"'C in microtiter wells containing 
200 ixL blocking buffer (1% ovalbumio, 1% BSA, 0.195 Tween®, and 0.05% 
NaNj in PBS), with agitation. The pins were then immersed in microtiter wells 
containing 175 fiL antisera obtained from human patients diagnosed as having 

30 HCV and allowed to incubate at 4°C overnight. The pins were assayed against 
antisera from three individual patients. Specimen #PAA 3663-s ("A") exhibited 
strong reaction to HCV Western blots, HCV competitive ELISA, HCV EUSA to 
clone ClOO-3 (at 1:1000 dihition), and EIBA responses of > 4-1- to ClOO, 5-1-1, 
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and C33c (C22 not done). (Antigen/clone names are per EPO Pub. Nos. 318,216 
and 388,232, as well as those described in the liteiatuie regarding HCV 
immunoassays available from Qrtbo Diagnostics Systems, Inc.) Neat plasma was 
dibited 1:500 in blocking buffer. Specimen ifPAA 33028 ("B") exhibited strong 
5 reaction to HCV Western blots, HCV competitive ELISA, HCV EUSA to clone 
ClOO-3 (at 1:500 dflution), and EIBA responses of >4+ to ClOO, 5-1-1, C33C 
and C22. Polyclonal antisera was partially purified by passage through a protein 
A column, and was used at a dilution of 1:200 in blocking buffer. Specimen 
#PAA S32931 ("C") exhibited moderate reaction to HCV Western blots (3+), 
10 HCV competitive ELTSA, HCV EUSA to clone ClOO-3 (at 1 : 64 dilution), and 
KBA responses of 3+ and 4+ to ClOO and 5-1-1, respectively (C33c and C22 
not done). Polyclonal antisera was partially purified by passage through a protein 
A column, and was used at a dilution of 1:500 in blocking buffer. 

The pins were washed in PBS/Tween* 20 (4 x 10 min) at room 
15 temperature, then incubated in microtiter weUs containing horseradish peroxidase- 
labeled goat anti-Human Ig antisera (175 ftL, 1:2000 dilution in blockii^ buffer 
without NaNj) for 1 hour at 25°C with agitation. The antihuman antisera is spe- 
cific for human Ig liglit and heavy cliains, and reacts with botih IgG and IgM 
classes. The pins were again washed in EBS/Tween» 20 (4 x 10 min) at room 
20 temperature. Substrate solution was prepared by dihiting NaI]^P04 (I M, 200 mL) 
and citric add (1 M, 160 mL) to 2 L with distilled water, adjusting the pH to 4.0. 
A2ano-di-3-ethylbenzt]!iazodinsolfonate (ABTS, 50 mg) and hydrogen peroxide 
(0.3 /iL/mL) was added to 100 mL of buffer immediately prior to use to complete 
tibe substrate sohition. The substrate solution (150 /tL) was added to each weU of 
25 a microtiter plate, and the pins immersed in the wells and incubated at 25 *'C in the 
dark. After color developed, the reactions were halted by removmg the phis, and 
absoifoance of the solutions read at 405 nm. 

The octamers listed below were immunoreactive with anti-HCV 
antisera. Peptides reacting with all three antisera are listed as epitopes, while pep- 
30 tides reacting with only one or two antisera are listed as weak epitopes (indicated 
by " - "). Particularly strong epitopes are labeled with letters rather than numbers 
ie.g., ^AA). 
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2011 


CQRGYKGV 


1808 


LFNILGGW]^28 




2012 


QRGYKGVW 








2013 


RGYKGVWR 


1821 


AAPGAATA - 




2014 


GYKGVWRG 


1851 


ILAGYGAG ^29 




2024 


IMHTRCHC 






:^3i 






1880 


VNLLPABL ~ 














2048 


VGPRICRN 


1908 


PGEGAVQW 13pG 




2049 


GPRICRNfY 


1909 


GEGAVQWM 




2050 


PRICRNYW 


1910 


EGAVQWMN 




2051 


RICRNYWS 


1911 


GAVQWMNR 




2052 


ICRNYWSG 


1912 


AVQWMNRL 




2053 


CRNYWSGT 


1913 


VQWMNRU 




2054 


RNYWSGTE 








2055 


NYWSGTEP 


1925 


RGNHVSH 




2056 


YWSGTEM 








2057 


WSGTEPIN 


1940 


AAARVTAI ^30 








1941 


AARVTAIL 




2071 


TPLPAPNY Bp32 


1942 


ARVTAILS 






1943 


RVTAILSS 




2088 


EEYVIRQV EpV 


1944 


VTAILSSL 




2089 


EYVIRQVG 


1945 


TAILSSLV 




2090 


YVIRQVGD 




ATT CCT \rT 




2091 


VERQVGDF 


1947 


ILSSLVTQ 




2092 


IRQVGDEH 


1948 


LSSLVTQL 




2093 


RQVGDFHY 


1949 


SSLVTQLL 








1950 


SLVTQLLR 




2108 


DNLKCPCQ ~ 


1951 


LVTQLLRR 
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2122 


EIELDGVR^W 




2280 


REAQALPV EpZ 


2123 


lELDGVEL 




2281 


EAQALPVW 


2124 


ELDGYRLH 




2282 


AQALPVWA 


2125 


LDGVRLHR 




2283 


QALPVWAR 


2126 


DGVELHRF 




2284 


ALPVWARP 


2127 


GVRLBRFA 




2285 


LPVWARPD 


2128 


VKLHEFAP 




2286 


PVWAEPDY 


2129 


RLHEFAPP 




2287 


VWAEPDYN 


2130 


LHRFAPPC 




2288 


WARPDYNP 


2131 


HRFAEPCK 




2289 


ARPDYNPP 


2132 


EFAPPCKP 




2290 


EPDYNPPL 


2133 


FAPPCKPL 








2134 


APPCKPLL 




2325 


PEPRKKRT]^35 


2135 


PPCKPLLR 




2326 


PPEKKRTV 


2136 


PCKPLLRE 




2327 


PEKKRTW 


2137 


CKPLLREE 








2138 


KPLLREEV 




2345 


AELASRSE ^36 


2139 


PLLREEVS 




2346 


ELASRSEG 


2140 


LLREEVSF^X 




2347 


LASRSEGS 


2141 


LREEVSFR 




2348 


ASRSEGSS 


2142 


REEVSFRV 




2349 


SRSEGSSS 


2143 


EEVSFRVG 








2144 


EVSFRVGL 




2382 


AESYSSMP ^37 


2145 


VSFRVGLH 








2146 


SFRVGLHE 




2401 


SDGSWSTV 


2147 


FRVGLHEY 


^38 






2148 


RVGLHEYP 














2417 


WCCSMSY 


2165 


EPETOVAV ~ 














2418 


VCCSMSYW 


2187 


GRRLARGS - 




2419 


CCSMSYWI 








2420 


CSMSYWIG 


2226 


UEANLLW^Y 




2421 


SMSYWIGA 


2227 


1 HA rvj 1 .1 .W K 




2422 


■M<8VWTfiAT 


2228 


EANLLWRQ 








2229 


ANLLWRQE 








2230 


NLLWRQEM 








2231 


LLWRQEMG 








2232 


LWRQEMGG 








2244 


VESENKW^33 








2245 


ESENKWI 








2246 


SENKWIL 








2247 


'ENKVYTLD 








2248 


NKWILDS 








2249 


KWTLDSF 








2250 


WILDSED 









2267 EISVPAE[]^34 
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2439 
2440 
2441 
2442 
2443 
2444 
2445 
2446 
2447 
2448 
2449 
2450 
2451 
2452 
2453 
2454 
2455 
2456 

2469 
2470 
2471 
2472 
2473 
2474 
2475 
2476 

2495 



QKLPINAL l^BB 

KLPINALS 

LPINALSN 

PINALSNS 

INALSNSL 

NALSNSLL 

ALSNSLLR 

LSNSLLRH 

SNSLLRHH 

NSLLRHHN 

SLLRHHNL 

LLRHHNLV 

LRHHNLVY 

EHHNLVYS 

HHNLVYST 

HNLVYSH 

NLVYSHS 

LVYSHSR 

QKJCVTFDR ^39 

KKVTFDRL 

KVTFDRLQ 

VTFDRLQV 

TFDELQVL 

FDRLQVID 

DKLQVUDS 

ELQVLDSH 

ASKVKANL ~ 



2602 LPLAVMGS 



PLAVMGSS 
LAVMGSSY 
AVMGSSYG 
VMGSSYGE 
MGSSYGEQ 
GSSYGEQR 
SSYGEQRV 
SYGEQRVE 
YGEQRVEE 



Bp41 



2603 
2604 
2605 
2606 
2607 
2608 
2609 
2610 
2611 
2612 



2632 KTPMGFSY 

2633 TPMGFSYD 

2634 PMGFSYDT 

2635 MGFSYDTR 

2636 GFSYDTRC 

2637 FSYDraCE 

2638 SYDTRCED 



2660 
2676 



YQCCDLDP - 
LTERLYVG 



2677 TERLYVGG 

2678 ERLYVGGP 

2679 RLYVGGPL 



2533 
2534 

2573 
2574 
2575 
2576 
2577 
2578 



RKAVTEHN^CC 
KAVIHINS 

GRKPARU^O 

RKPARUV 

KPARUVF 

PARIIVFP 

ARUVFED 

RUVEPDL 



BpFF 



2688 NSRGENCG 

2689 SRGENCGY 

2690 RGENCGYR 

2691 GENCGYRR 

2692 ENCGYRRC 

2693 NCGYRRCR 

2707 TSCGNTLI Ep42 

2721 AACRAAGL ~ 



2757 AFTEAMTR 

^3 

2758 FTEAMTRY 

2759 TEAMTRYS 

2760 EAMTRYSA 

2761 AMTRYSAP 

2762 MTRYSAPP 



wo 93/00365 



-40- 



PCrA3S92/05388 



^5 



^GG 



2779 


DLELUSC lBp44 


2878 


DIJ>PnQR^7 


2780 


LELHSCS 


2879 


LPPEQRL 






2880 


PEHQELH 


2794 


HDGAGKRV 


2881 


PHQELBG 






2882 


nQSLHGL 


2795 


DGAGKRVY 


2883 


IQRLEIGLS 


2796 


GAGKRVYY 


2884 


QRLHGLSA 


2797 


AGKRVYYL 


2885 


SIH6LSAP 


2798 


GKRVYYLT 


2886 


LSGLSAFS 


2799 


KRVYYLTR 


2887 


HGLSAFSL 


2800 


RVYYLTRD 


2888 


GLSAFSLH 


2801 


VYYLTRDP 


2889 


LSAFSLHS 


2802 


YYLTRDPT 


2890 


SAFSmSY 






2891 


AFSmSYS 


2817 


WETARHTP 


2892 


FSLHSYSP 






2893 


SLHSYSPG 


2818 


ETABHTPV 


2894 


mSYSPGE 


2819 


TARHTPVN 


2895 


HSYSPGEI 


2820 


ARHTPVNS 






2821 


RHTPVNSW 






2822 


HTPVNSWL 






2823 


TPVNSWLG 






2824 


PVNSWLGN 






2825 


VNSWLGNI 






2826 


NSWLGNU 






2827 


SWLGNIIM 






2828 


WLGNQME 






2829 


LGNQMEA 






2830 


GNUMEAP 






2831 


NUMEAPT 






2832 


IIMEAPTL 






2833 


IMEAPTLW 






2834 


MEAFTLWA 






2835 


EAPTLWAR 






2836 


APTLWAEM 






2837 


VTLWASMl 






2838 


TLWAEMIL 






2839 


LWARMEM 






2840 


WARMILMT 






2841 


ASMUMIH 






2842 


PMTT.MTTTF 






2843 


MELMIHFE 






2863 


LDCEIYGA^>46 






2864 


DCEIYGAC 






2865 


CEiyGACY 






2866 


EIYGACYS 






2867 


lYGACYSI 
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2912 


LGVPPLRA Bp4S 


2913 


GVPPLRAW 


2914 


VPPLRAWR 


2938 


AAICGKYL 13p49 


2939 


AICGKYLE 


2940 


ICGKYLEN 


2966 


DLSGWETA 



10 ^HH 

2984 VSHAEPRW ^ 

15 IV.B. Differential Amy 

The following assay was perfomed to distinguish early antigens 
from later antigens. Antibodies to the early antigens may be detected, and used to 
diagnose HCV infection more quickly. 

Serial bleeds were obtained from a human patient presenting with 

20 elevated ALT, but negative for anti-ClOO-3 antibody. The five bleeds obtained 
prior to complete seroconversion (CI 00-3 positive) were pooled and used in the 
assay at a dilution of 1:2000. The assay was conducted as described in Section 
rV.A. above. However, one duplicate set of pins was incubated with horseradish 
peroxidase-labeled goat anti-Human IgG specific antisera, while the other set was 

25 incubated with horseradish peroxidase-labeled goat anti-Human IgM specific anti- 
sera. Epitopes immunoreactive with IgM antibodies are early epitopes. 

The results indicated that most early epitopes are found in the 
region extending from about amino acid 480 to about amino add 650. Paiticularly 
strong IgM epitopes were octamers beginning with amino acid nos. 506, 510, 523, 

30 553, 562, 580, and the region from 590 to 620. Assays which employ antigens 
bearing epitqpes from this region win permit diagnosis of HCV infection at an 
early point than assays enaploying other antigens. 

We have additionally tested serial plasma specimens taken from five 
patients with open heart post-transfiision NANB hepatitis, with studies followed 

35 for 3-12 yeare. Initial bleed dates were less than one weelc apart. Each specimen 
was tested for IgG and IgM by EIA against one core antigen (C22), two envelope 
antigens (El and E2), and three nonstnictural region antigens (C33c, ClOO, and 
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NS5). We found that the response to C22 and C33c preceeded the IgG 
response for those antigens. NS-5 also induced an IgM response, but this 
response did not preceed the IgG response for that antigen. Thus, one can prepare 
assays capable of determining very early stages of infection by utilizing epitopes 
5 derived ftom the C22 and C33c regions and assaying for IgM binding. Antibodies 
to the C33c region persisted for the longest period of time, suggesting that 
diagnostic assays directed toward C33c should be the most reliabte. 

IV.C. Sequence Variations in HCV Isolates from Different Individuals 
10 Isolates of HCV which contain sequences which deviate from 

CDC/HCVl were idratified in human individuals, some of whom were 
serologically positive for anti-ClOO-3 antibodies (EClO was antibody negative). 
Identification of these new isolates was accomplished by cloning and sequencing 
segments of the HCV genome which had been amplified by the FCR technique 
15 using CDC/HCl sequences. The method utilizes primers and probes based upon 
the HCV cDNA sequences described hei^. The first step in the method is the 
synthesis of a cDNA to either the HCV genome, or its replicadve uitetmediate, 
using reverse tianscriptasB. After synthesis of the HCV cDNA, and prior to 
amplification, the RNA in the sample is degraded by techniques known in the art. 
20 A designated segment of the HCV cDNA is then amplified by the use of the 

appropriate primers. The amplified sequences are cloned, and clones containing 
the amplified sequences are detected by a probe which is complementary to a 
sequence lying between the primers, but which does not overij^) the primers. 

25 IV.C.l. HCV Isolates Isolated from Humans in the U.S. 

Blood samples which were used as a source of HCV virions were 
obtained fix)m the American Red Cross in Charlotte, North Carolina, and from the 
Community Blood Center of Kansas, Kansas City, Missouri. The samples were 
sraeened for antibodies to the HCV ClOO-3 antigen usmg an ELISA assay and 

30 subjected to siqiplemental Western blot analysis usmg a polyclonal goat anti-human 
HRP to measure anti-HCV antibodies. Two samples, j?23 and #27, from the 
American Red Cross and from the Communis Blood Center of Kansas, 
respectively, were determined to be HCV positive by these assays. 
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Viial paiticles present in the seium of these samples were isolated 
by ultracentriiiigation under the conditions described by Bradley et al. (1985). 
SNA was extracted from the particles by digestion with proteinase K and SDS at 
final concentrations of 10 /tg/mL proteinase K, and 0.1% SDS; digestion was for 
5 1 hour at 2TC. Viral RNA was further purified by extraction with chloroform- 
phenol. 

HCV RNA in the preparation of RNA was reverse transcribed into 
cDNA. After both strands of the cDNA were synthesized, the resulting cDNA 
was then amplified by the PGR method. The HCV cDNAs in three clones derived 
10 from each HCV isolate, were subjected to sequence analysis. Analysis was 
essentially by the method described in Chen and Seeburg (1985). 

Consensus sequences of the clones derived from HCV in samples 23 
and 27 are shown in Fig. 3 and Fig. 4, respectively. The variable sequences are 
also shown in these figures, as are the amino acids encoded in the consensus 
15 sequences. 

Figures 5 and 6 show comparisons of the aligned positive strand 
nucleotide sequences (Fig. 5) and putative amino acid sequences (Fig. 6) of 
samples 23, 27, and HCVl. The amino acid sequence of HCVl in Fig. 6 
r^resents amino add numbers 129-467 of the HCV polyprotein encoded by the 

20 large ORF in the HCV genomic RNA. An examination of Figs. 5 and 6 show that 
there are variations in the sequences of the three isolated clones. The sequence 
variations at the nucleotide level and the amino acid level are summarized in the 
table immediately below. Li the table, the polypeptides designated S and NSl 
rqjTBsent amino acid numbers 130 to -380, and 380 to -470, respectively, as 

25 those domains were previously known. The numbering is from the putative 

initiator methionine. The tenninology S and NSl is based upon the positioning of 
the sequences encoding the polypeptides using the Flavivirus model. As discussed 
above, however, recent evidence suggests that there is not total correlation 
between HCV and the Flaviviruses with regard to viral polypeptide domains, 

30 particularly in the putative E/NSl domains. Indeed, HCV polypeptides and their 
coding domains may exhibit substantial deviation from the Flavivirus model. 
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Nucleotide EncodingAmino Add Bacoded 
overaU S NSl overaU S NSl 



5 % 


% 


% 




% 


% 


HCV1/HCV23 93 


95 


91 


92 


95 


87 


HCV1/HCV27 89 


93 


84 


89 


95 


82 


B-C^liraCNll 89 


93 


85 


90 


93 


84 



Although there are variations in the newly isolated HCV sequences, 
the cloned sequences from samples 23 and 27 (caUed HCV23 and HCV27) each 
contain 1019 nucleotides, indicating a lack of deletion and addition mutants in this 
region in the selected clones. Hie sequences in Figs. 5 and 6 also show that the 
isolated sequences are not rearranged in this region. 

A comparison of the consensus sequences for HCVl and for the 
other isolates of HCV is summarized in the Table, 5iq>ra. The sequence variations 
between the chimpanzee isolate HCVl, and the HCVs isolated from humans are 
about the same as that seen between the HCVs of human origin. 

It is of interest that the sequence variations in two of the putative 
domains is not unifiinn. The sequence in a putative S region appears to be 
relatively constant, and randomly scattered throughout the region. Ixl contrast, a 
putative NSl region has a higher idegree of variabilitjr than the overall sequence, 
and the variation appears to be in a hypervariable pocket of about 28 amino adds 
which is located about 70 amino acids downstream ftom the putative N-tenninus 
of the putative polyprotein. 

Although it may be argued that the detected variations were 
introduced during the amplification process, it is unlikely that all of the variations 
are ftom this result. It has been estimated that Taq polymerase introduces errors 
into a sequence at approximately one base per 10 kilobases of DNA template per 
cycle (SaiM et al. (1988)). Based upon this estimate, up to 7 errors may have 
been introduced during the PGR amplification of the 1019 bp DNA fiagment. 
However, the three subclones of HCV-23 and HCV-27 yielded 29 and 14 base 
variations, respectively. The following suggest that these variations are naturally 



wo 93/00365 



-45- 



PCr/US92/05388 



occurring. About 60% of the base changes are sflent mutations whfch do not 
change the amino add sequence. Variations introduced by the Taq polymerase 
during PGR amplification would be expected to occur randomly; however, the 
results show that the variant sequences are clustered in at least one specific region. 

5 

IV.C.2. HCV Isolates fiom Humans in Italv and in the TT.S. 

Segments of HCV RNA present in different isolates were amplified 
by tbe HCV/cPCR method. These segments span a region of -0.6 Kb to -1.6 
Kb downstream from the methionine encoding start codon of the putative HCV 

10 polyprotein. The isolates are from biological specimens obtained from HCV 
infected individuals. More specifically, isolate HCT #18 is from human plasma 
from an individual in the U.S.A., ECl and EClO are from a liver biopsy of an 
Italian patient, and Th is from a peripheral blood mononucleocyte fraction of an 
American patient. Comparable segments of HCV RNA have been isolated from a 

15 chimpanzee. 

SNA was extracted from the human plasma specimens using 
phenol:CHCl3:isoamyl alcohol extraction. Ether 0.1 mL or 0.01 mL of plasma 
was diluted to a final volume of 1.0 mL, with a TENB/proteinase K/SDS solution 
(0.05 M Tris-HCL, pH 8.0, 0.001 M EDTA, 0.1 M NaCl, 1 mg/mL Proteinase 

20 K, and 0.5 % SDS) containing 10 to 40 /xg/mL polyadenylic acid, and incubated at 
Sl'C for 60 minutes. After this proteinase K digestion, the resultant plasma 
ftactions were dq)rofeinized by extraction with TB (50 mM "nis-HCl, pH 8.0, 1 
mM EDTA) saturated phenol, pH 6.5. The phenol phase was separated by 
centrifugation, and was reextracted with TENS contauiing 0.1% SDS. The 

25 resultmg aqueous phases from each extraction were pooled, and extracted twice 
with an equal volume of phenol/chlorofoim/isoamyl alcohol [1:1(99:1)], and then 
twice with an equal volume of a 99:1 mixture of chloroform/isoamyl alcohol. 
Following phase separation by centrifugation, the aqueous phase was brought to a 
final concentration of 0.2 M Na acetate, and the nucleic acids were precipitated by 

30 the addition of two volumes of ethanol. The precipitated nucleic acids were 

recovered by ultracentrifiigation ia a SW 41 rotor at 38 K, for 60 minutes at 4''C 
or in a microfiige for 10 minutes at 10 K, 4''C. 
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RNA extracted from the liver biopsy was provided by Dr. F. 
Bonino, Ospedale Maggiore di S. Giovanni Battista, Torino, Italy. The 
mononucleocyte fraction was obtained by sedimentation of the individual's aliquot 
of blood through Ficoll-Paque® (Pharmacia Corp), usmg the manufsictuier's dir- 
5 ections. Total RNA was extracted from the fraction using the guamdinium thio- 
cyanate procedure described in Choo et al (1989). 

Synthesis of HCV cDNA from the samples was accomplished using 
reverse transcriptase. Following ethanol precipitation, the precipitated RNTA or 
nucleic acid fraction was dried, and resuspended in DEPC treated distilled water. 

10 Secondary structures in the nucleic acids were disrupted by heating at 65 "C for 10 
minutes, and the samples were immediately cooled on ice. cDNA was synthesized 
using 1 to 3 /tg of total ENA from liver, or firom nucleic acids (or ENA) extracted 
from 10 to 100 ftL of plasma. The synthesis utilized reverse transcriptase, and 
was in a 25 ^ reaction, using the protocol specified by the manu&cturer, BEL. 

15 AH reaction mixtures for cDNA synthesis contained 23 units of the KNAase 
inhibitor, Rnasin* (Fishet/Promega). Following cDNA synthesis, the reaction 
mixtures were diluted with water, boiled for 10 minutes, and quiddy chilled on 
ice. 

Each set of samples was subjected to two rounds of PGR 
20 amplification. The primers for the reactions were selected to amplify regions 

designated "EavL" and EnvR". The "EnvL" region encompasses nucleotides 669- 
1243, and putative amino acids 117 to 308; the "EnvR" region encompasses 
nucleotides 1215-1629, and encodes putative amino adds 300-408 (the putative 
amino acids are numbered starting from the putative mediionine initiation codon). 
25 The PCR reactions were performed essaitially according to the 

manufacturer's directions (Cetus-Perkin-Ehner), except for the addition of 1 fig of 
RNase A. The reactions were carried out in a final volume of 100 , The PCR 
was performed for 30 cycles, utilizing a regimen of 94°C (1 min), 37°C (2 min), 
and 72°C (3 min), with a 7 minute extension at 72''C for the last cycle. The 
30 samples were then extracted with phenohCHC^, ethanol precipitated two times, 
responded in 10 mM Tris HCl, pH 8.0, and concentrated using Centricon-30 
(Amicon) filtration. This procedure efficiently removes oligonucleotides less than 



wo 93/00365 ^7. PCr/US92/05388 

30 nucleotides in size; thus, tiie piimeis from tiie first round of PCR amplification 
areiemoved. 

Hie Centiicon-30 conc^trated samples were then subjected to a 
second round of VCR amplification. Amplification by PCR was for 35 cycles 
5 utilizing a regimen of 94'C (1 min), &)°C (1 min), and 72"'C (2 min), with a 7 
minute extension at 72 "C for the last cycle. The samples were then extracted with 
phenohCHas, precipitated two times, and digested with EcoRI. The PCR 
reaction products were analyzed by separation of the products by electrophoresis 
on 6% polyaciylamide gels. DNA of approximately the estimated size of the 
10 expected PCR product was electroeluted from the gels, and subcloned into eiUier a 
pGEM-4 plasmid vector or into Xgtll. The expected product sizes for the EnvL 
and EnvR after the first round of amplification are 615 bp and 683 bp, 
respectively; after the second round of amplification the expected product sizes for 
EnvL and EnvR are 414 bp and 575 bp, reqjectively. The plasmids containing the 
15 amplified products were used to transform host cells; the pGEM-4 plasmid was 
used to traosform DH5-alpha, and Xgtll was used to transform C600 delta-HFL. 
Clones of the transformed ceUs which either hybridized to the appropriate HCV 
probes, or those which had inserts of the correct size were selected. The inserts 
were then cloned in M13 and sequenced. The probes for all of the HCV/cPCR 
20 products consisted of ^^P labeled sections of HCV cDNA which had been piepared 
by PCR amplification. 

Sequence information on variants in the EnvL region was obtained 
from 3 dones ftom HCT #18, 2 clones from TH, 3 clones from ECl, and from 
the HCVl clones. A comparison of the composite nucleotide sequence of each 
25 isolate derived from these clones is shown in Fig. 7. M the figure, each sequence 
is shown 5' to 3' for the sense strand for the EnvL region, and the sequences have 
been aligned. The vertical lines and capital letters indicate sequence homology, 
the absence of a line and an uncapitaHzed letter indicates a lack of homology. The 
sequences shown in the lines are as follows: line 1, Thorn; line 2, ECl; line 3, 
30 HCT #18; line 4, HCVl. 

Sequence information on variants in the EnvR region was obtained 
from two clones of EClO, and from HCVl clones. The two EClO clones difi'ered 
by only one nucleotide. A comparison of the nucleotide sequences of EC10{clone 
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2) and a composite of the HCVl sequences is shown in Fig. 8; each sequence is 
shown 5' to 3' for the sense strand of the EnvR region, and the sequences have 
been aligned. The double dots between the sequences mdicate sequence 
homology. 

5 A comparison of the amino acid sequences encoded in the EnvL 

(amino acids #117-308) and EnvR region (amino acids #300^38) for each of the 
isolates is shown in Fig. 9 and Fig. 10, respectively. Included in the Figures are 
sequences for the isolates JH23 and JH27, described above. Also indicated are 
sequences from a Japanese isolate; these sequences were provided by Dr. T. 

10 Miyamura, Japan. In the figures, the amino acid sequence for the region is given 
in its entirety for HCVl, and the non-homologous ammo acids in the various 
isolates are indicated. 

As seen in Fig. 9, In the EnvL region there is overall about a 93 % 
homology between HCVl and the other isolates. HCT18, Hi, and ECl have 

15 about a 97% homology with HCVl; JH23 and JH27 have about 96% and about 
95% homology, respectively, with HCVl. Fig. 10 shows flat the homologies in 
the EnvR region are significantly less than in the EnvL region; moreover, one 
subregion appears to be hypervariable (/.c, from amino acid 383-405). lliis data 
is summarized in the Table immediately below. 

20 Table: Homology of EnvR Region 

Isolate Percent HtMnology wifli HCVl 

AA33Q-AA438 AA383-AA405 

]H23(U.S.) 83 57 

JH27(U.S.) 80 39 

25 Japanese 73 48 

EClO ataly) 84 48 

VI. Industrial Applicability 

The epitopes identified herein can be used to make polypeptide 
30 products as described above for applications such as the screening of blood for 
HCV infection, clinical HCV diagnosis, the generation of antibodies, and 
preparation of medicaments. Other applications are described above, and still 
others wiU be readily apparent to those of ordinary sidll. 
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WHAT Ig CLAIMED; 

1. A polypeptide comprising a truncated HCV sequence containing 
an HCV epitope of the formula 
5 aa,-aay 
wherein aa denotes an amino add; 
X and y are integers such that y-x S 6; 

aa,-aay indicates a portion of the amino acid sequence of Figure 1 ; and 
X is selected from the group consisting of 23-34, 36, 66-79, 81-94, 96- 

10 98, 101-103, 186-189, 191, 206, 223, 232, 256, 286, 297-299, 321, 347, 357, 
413, 414, 432, 465-471, 480-484, 501, 502, 521, 540-549, 579, 594-599, 601- 
613, 641, 662-665, 685, 705, 706, 729, 782-789, 801, 851-855, 893, 916, 928, 
946, 952-954, 1026, 1072, 1109, 1112-1117, 1218, 1240, 1280-1285, 1322, 1338, 
1371, 1384, 1410, 1411, 1454, 1492, 1493, 1532-1535, 1560, 1561, 1566-1568, 

15 1571-1577, 1601-1607, 1615-1620, 1655, 1695, 1710-1712, 1728, 1729, 1758- 
1762, 1781, 1808, 1821, 1851, 1880, 1908-1913, 1925, 1940-1948, 1951, 1966- 
1969, 1999, 2001-2004, 2006-2014, 2024, 2048-2053, 2055-2057, 2071, 2088- 
2093, 2108, 2122-2148, 2165, 2187, 2226-2232, 2244-2249, 2267, 2281-2286, 
2288, 2289, 2325-2327, 2346, 2347, 2349, 2382, 2401, 2417-2422, 2439-2444, 

20 2446-2456, 2469, 2471-2476, 2495, 2533, 2534, 2573-2578, 2602-2604, 2606- 
2612, 2632-2638, 2660, 2676-2679, 2688-2693, 2707, 2721, 2757-2762, 2779, 
2794, 2795, 2797-2799, 2801, 2802, 2817-2843, 2863-2867, 2878-2884, 2886- 
2895. 

25 2. The polypeptide of claim 1 which is about 100 amino acids or less 

in length. 

3. Hie polypeptide of daim 2 wherein y-x ^ 50. 
30 4. Hie polypeptide of claim 2 wherein y-x £ 20. 



5. The polypeptide of claim 2 wherein y-x < 10. 
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6. The polypeptide of claim 2 whetdn x is selected ftom the group 
consisting of 506, 510, 523, 553, 562, 580, and 590-620. 

7. A polypeptide of about 100 amino acids or less comprising an 
5 HCV qpitepe of the formula 

aa^-aay 

wherein aa denotes an amino acid; 

X and y are integers such that y-x S 6; 

aa^-aay indicates a portion of the amino acid sequence of Figure 1; and 
10 X is selected from the group consisting of 35 (where y is less than 45), 

80 (where y is less than 90), 95 (where y is less than 110), 99 (where y is less 
than 120), 100 (where y is less than 150), 190 (where y is less than 210), 500 
(where y is less than 550), 600 (where y is less than 625), 1260 (where y is less 
than 1280), 1569 (where y is less than 1931), 1570 (where y is les* than 1590), 
15 1694 (where y is less than 1735), 1949 (where y is less than 2124), 1950 (where y 
is less than 1985), 2000 (where y is less than 2050), 2005 (where y is less than 
2025), 2054 (where y is less than 2223), 2250 (where y is less than 2330), 2287 
(where y is less than 2385), 2290 (where y is less than 2310), 2345 (where y is 
less than 2375), 2348 (where y is less than 2464), 2445 (where y is less than 
20 2475), 2470 (where y is less than 2490), 2605 (where y is less than 2620), 2780 
(where y is less than 2830), 2796 (where y is less than 2886), 2800 (where y is 
less than 2850), and 2885 (where y is less than 2905). 

8. An immunoassay reagent compnsing a polypeptide according to 

25 any of claims 1-7. 

9. The immunoassay reagent of claim 8 wherein y-x £ 50. 

10. The immunoassay reagent of claim 8 wherein x is selected ftom 
30 the group consisting of 506, 510, 523, 553, 562, 580, and 590-620. 
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11. A m^od for d^ec^g the pissence of antibodies inununoisactive 
with Hepatitis C vims (HCV) proteins in a sample, said method comprising: 

contacting an immobilized hnmunoassay reagent according to daim 8 
with said sample; and 
5 detecting antibodies bound to said reag^. 

12. A method for inducing an immunological response in a subject 
against HCV, said method comprising: 

10 administering to said subject an effective amount of a polypeptide 

accoding to any one of claims 1-7. 

13. A composition for inducing an immunological response in a 
subject against HCV, said composition comprising an effective amount of a 

15 polypqptide accoding to any one of claims 1-7. 

14. A monoclonal or polyclonal antibody composition wherein said 
antibodies bind the HCV epitope of a polypqptide accoiding to claim 1 . 

20 15. A method of making a polypqptide according to any of claims 1-7 

wherein said polypeptide is prepared by recombinant e^gpression or chemical 
synthesis. 
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FIG, I - 1 

R T 

HSTNPKPQKKNKRNTNRRPQDVKFPGGGQIVGGVYLLPRRGPRLGVRATR 

KTSERSOPRGRROPIPKARRPEGRTWAQPGYPWPLYGNEGCGWAGWLLSP-100 

RGSRPSWGPTDPRRRSRNLGKVIDTLTCGFADLMGYIPLVGAPLGGAARA 

T 

LAHGN^VLEDGVNYATGNLPGCSFSIFLIALLSCLTVPASAYQVRNSTGL-200 
YHVTNDCPNSSIVYBAADAILHTPGCVPCVREGNASRCWVAMTPTVATRD 
GKLPATQLRRHIDLLVGSATLCSALYVGDLCGSVFLVGQLFTFSPRRHWT-3 0 0 

V 

TOGCNCSIYPGHITGHRMAWDMMMNWSPTTALVMAQLLRIPOAILDMIAG 

AHWGVIAGIAYFSMVGNWAKVLVVLLLFAGVDAETHVTGGSAGHTVSGFV-400 

SLLAPGAKQNVQLINTNGSWHLNSTALIJCNDSLNTGWLAGLFYHHKFNSS 

G CPERLAS CRPLTDFDOGWGP IS YANGSGPDQRPYCWHYPPKPCG IVPAK-5 0 0 

SVCGPVYCFTPSPVWGTTDRSGAPTYSWGBNDTDVFVLNNTRPPLGNWF 

GCTWMNSTGFTKVCGAPPCVIGGAGNNTLHCPTDCFRKHPDATYSRCGSG- 6 0 0 

I 

PWLTPRCLVDYPYRLWHYPCTINYTIFKIRMYVGGVEHRLEAACNVrrRGE 
RCDLEDRDRS ELS PLLLTTTOWQVLPCS FTTLPALSTGLI HLHQN IVDVQ- 7 0 0 
YLYGVGSS I AS WAI KWEYVVLLFIJXADARVCSCLWMMLLI SQAEAALEN 
LVILNAASLAGTHGLVSFLVFFCFAVraJKGKWVPGAVYTFYGMWPLLLLL-800 
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(N) 

LALPORAYALDTEVAASCGGWLVGLMALTLSPYYKRYISWCLWWLQYFL 
TRVEAQLHVWI P PLNVRGGRDAVILLMCAVHPTLVFDITKLLLAVFG PLV7- 900 
I LQASLLKVPy FVRVQGLLRFCALARKMIGGHYVQMVI IKLGALTGTYVY 
NH LT P LRDV7AH NG LRDLAVAVEP WFSQMETKLI TWG ADTAACGD 1 1 NGL- 1 0 0 0 
PVS ARRGREILLG PADGMVSKGWRLLAPITAYAQQTRGIZX3CI ITSLTGR 
DKNQVEGEVQI VS TAAOTFLATCINGVCWTVYHGAGTRTIASPKG PVIQM- 1 1 0 0 

S T 

YT^^VDQDLVGWPAPOGSRSLTPCTCGSSDLYLVTRHADVIPVRRRGDSRG 

S LLS P RP I S YLKG S SGG P LLC P AG H AVG I FRAAVCTRG VAKAVDF I P VEN - 1 2 0 0 

LETTMRSPVFTDNSSPPWPQSFQVAHLHAPTGSGKSTKVPAAYAAQGYK 

L 

VLVLNPSVAATLGFGAYHSKAHGIDPNIRTGVRTITTGSPITYSTYGKFL~1300 



ADGGCSGGAYDI I ICDECHSTDATSILGIGTVLDQAETAGARLWLATAT 

PPGSVTVPHPNIEEVALSTTGEIPFYGKAIPLBVIKG6RHLIFCHSKKKC-1400 

DELAAKLVALGINAVAYYRGLDVSVIPTSGDWWATDALMTGYTGDFDS 

(S) 

VIDCNTCVTQTVDFSLDPTFTIETITLPQDAVSRTQRRGRTGRGKPGIYR-1500 
FVAPGERPSGMFDSSVLCECYDAGCAWYBLTPAETTVRLRAYMNTPGLPV 
CQDHLEFWEGVFTGLTHIDAHFLSQTKQSGENLPYLVAYQATVCARAQAP- 1 6 GO 
PPSV7DQMWKCLIRLKPTLHGPTPLLYRLGAVQNEITLTHPVTKYIMTCMS 
ADLEWTSTWVLVGGVLAALAAYCLSTGCWIVGRWLSGKPAI I PDREV- 1700 
LYREFDEKEECSQHLPYIEOGMMLAEQFKQKALGUJQTASRQAEVIAPAV 
OTNWQKLETFWAKHMWNPISG IQYLAGLSTLPGNPAI ASUiAFTAAYTS P- 1 800 
LTTSQTLLFNILGGWVAAQLAAPGAATAFVGAGLAGAAIGSVGLGKVLID 

«3) 

ILAGYGAGVAGALVAPKIMSGBVPSTEDLVNLLPAILSPGALWGWCAA-1900 

(HC) 

ILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPESDAAARVTAILSS 



FIG. I -2 
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LTVTOLLRRLHOWISSECTTPCSGSWLRDIWDWICEVLSDFKTWLKAKLM~2000 
(V) 

PQLPG I P FYS CQRG YKGVWRGDG IMHTRCHCG AE ITG HVKNGTMR IVG PR 

TCRNMWSGTFPINAYTTGPCTPLPAPNYTFALWRVSAEEYYEIRQVGDFH-2100 

YVTGMTTDNLKCPCQVPSPEFFTELDGVRLHRFAPPCKPLLREEVSFRVG 

LHEYPVGSQLPCEPEPDVAVLTSMLTDPSHITAEAAGRRLARGSPPSVAS-2200 

SSASOLSAPSLKATCTANHDSPDAELIEANLLWRQEMGGNITRVESENKV 

VILDSFDPLVAEEDEREISVPAEILRKSRRFAQALPVWARPDYNPPLVET-2300 

(S) 

V7KKPDYEPPWHGCPLPPPKSPPVPPPRKKRTWLTESTLSTALAELATR 

(FA) 

SFGSSSTSGITGDNTTTSSEPAPSGCPPDSDAESYSSMPPLEGEPGDPDL-24 00 
SDGSV7STVS S EAN AEDWCCSMS YSWTG ALVTPCAAEEQKLP I NALSNSL 
LRHHNL\rYSTTSRS ACQRQKKVTFDRI^NTl^SHYQDVLKBVKAAASKVKA- 2500 

(F) 

NLLSVEEACSLTPPHSAKSKFGYGAKDVRCHARKAVTHINSVWKDLLEDN 

VTP IDTTIMAKNEVFCVQPEK6GRKPARLIVFPDLGVRYCEKMALYDVVT-2 6 00 

KLPLAVMGSSYGFQYSPGQRVEFLVQAWKSKKTPMGFSYDTRCFDSTVTE 

(G) 

SDIRTEEAIYQGCDLDPOARVAIKSLTERLYVGGPLTNSRGENCGYRRCR-2700 
ASGVLTTSCGNTLTCYIKARAACRAAGLODCTMLVCGDDLWICESAGVO 
?.5S?S^S^^^^^^^^^^PP^^PPQP2^L2LITSCSSNVSVAHDGAGKR-2800 
VYYLTRDPTTPLARAAWETARHTPVNSWLGNIIMFAPTLWARMILMTHFF 
SVLI ARDQLEQALDCEI YGACYS lEPLDLPP I IQRLHGLSAFSLHSYSPG- 2 900 

G 

E I N R VAACLRKLX3 VP PLRAWRHRARSVRARLLARGGRAAI CGKYLFNWAV 

(P) 

RTKLKLTPIAAAGQLDLSGWFTAGYSGGDIYHSVSHARPRWIWFCLLLLA-3000 
AG VG I YLLPNRO- 3011 

Stop codon 



( ) - Heterogeneity due possibly 

to 5' or 3' terminal cloning 
artefact. 
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10 20 30 40 

EClO GAATTCGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATAT 
X::: ::!:::!:::::::::::::::::::::::::: :::::: 
HCVl CTCTCCCAGGCGCCACTGGACGACGCAAG6TTGCAATTGCTCTATCTATCCCGGCCATAT 
550 560 570 580 590 600 

50 60 70 80 A 90 100 

AACAGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTAGT 
■ * ' s ;:;:;:::::::::;;;:::::::;;;:•:::::; : . . 

AACGGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTOXSGT 
610 620 630 640 650 660 

110 120 130 140 150 160 

GGTAGCTCAGCTGCTCCGGATCCCACAAGCCATCTIGGACATGATCGCTGGTGCTCACTG 

AATGGCTCAGCTGCTCCGGATCCC^^ 

670 680 690 700 710 720 

170 180 190 200 210 220 

GGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCTTGGC 
: : ; : : J : ; ! : : : z ; : ; . . 5 . . . 5 . J . . . J , J ; 5 . . . . . . . , . . . . J . ^ 

GGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGT 
730 740 750 760 770 780 

230 240 250 260 270 280 

AGTCCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACTGGGGGGATCGCCGC 

AGTGCTGCTGCTATTTGCCGGCGTCGACG^ 

790 800 810 820 830 840 

290 300 310 320 330 340 

Cy^CTACGGCTAGCCTTACTGGTCTCTTCAATTTAGGTGCCAAGCAGAACATCCAGCT 

CCACACTGTGTCTCGATTTCTkGCCTCCT^ 

850 860 870 880 890 900 

350 360 370 380 390 400 

GATCAACACCAACGGCAGTTGGCACATCAACAGGACGGCCTTGAACTGCAATGATAGCCT 
::::::::::::::::::::..,,. : : : : • : : ; ; • j j s 

GATCAACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGATAGC^ 
910 920 930 940 950 960 

410 420 

CAACACCGGCTGGAATTC 
::::::::::: :X 

CAACACCGGCTGGTTGGCAGGGCTTTTCTATCACCAGAAGTTCAACTCTTCAGGCTGTCC 
970 980 990 1000 1010 1020 

FIG,8 
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AA #117- 308 f putative envftlope reginn) FIG.9 

1) HOT #18 (USA) 3 clones sequenced 

2) JH23 (USA) ? ^ 

3) JH 27 (USA) ? 

4) PBL-Th (USA) 2 clones sequenced 

5) EC1 (Italy) 3 clones sequenced 

6) HCV-1 (chimpanzee) multiple 

I) (P) 

2) 
3) 
4) 
5) 

6)RNLGKVIDTLTCGFADLMGYIPLVGAPLGQAARAUHGVRVLEDGVNYATGNL 

1) H 

2) 

3) S T T 

4) L 

5} (F) s 

6)PGCSFSIFLLALLSCLTVPASAYQVRNSTGLYHVTNDCPNSSIVYEAADAILH 



1) (H) V V 

2) A D V V K T 



T 



3)S 



PVA N 



ART 

5) H V T 

5)TPGCVPCVREGNASRCWVAMTPTVATRDGKLPA-rc)LRRHIDLLVGSA-n.CS 

1) 

2) I D 

3) D 
4) 

5) I 

6) ALWGDljCGSVFLVGQLFTFSPRRHWrraGCNCSI 
SUMMARY: "S" AA117-308 (932) 

HCins, PBL-Th, ECl(Italy) have 97% homology with HCV-1 

JH23 and JH 27 have 96% and 95% homology with HCV-1, respectively 
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AA#3QQ'438 ( ^terminal region of the putative envalope region and 
amino -1/3 of NSI^ 



1) JH23 
2} JH27 

3) Japanese isolate (T. Miyamura) 

4) EC10 (Italy) 



5) HCV-1 (chimpanzee) 



1) D 
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2 Clones sequenced 

(one nt difference, wliich did not 

result In an amino acid change) 
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^) R ARSTA VA 

2) T YT N A R TQALT F 
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2) FT Dl 
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I R 

I R 
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5)LAPGAKQIWQLINTNGSWHLNSTALNCNDSLNTGWL 



SUMMARY: NS 1 AA 330-660 

"Isolate" ^Homology (AA330-438) 
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JH23 
JH27 

Japanese 
EClO (Italy) 



FIG. 1 0 



ZHomology (AA383-405) 
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